arxiv:2504.02507
Abhay kumar
akanyaani
AI & ML interests
LLMs, GenAI, Transformers
Recent Activity
liked
a model
about 2 months ago
Fortytwo-Network/Strand-Rust-Coder-14B-v1
upvoted
a
paper
7 months ago
ZClip: Adaptive Spike Mitigation for LLM Pre-Training