Llama baseline checkpoints (0.6B, 1.3B)
Chunyuan Deng
CharlesDDDD
·
AI & ML interests
Architecheture, Interpretability.
Recent Activity
updated
a collection
2 days ago
looped_transformer
updated
a model
2 days ago
CharlesDDDD/looped_transformer_loop_count_4