Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Juan Cervino's picture
1

Juan Cervino

jcervino
·
https://juancervino.github.io/
  • juancervino
  • juancervino4

AI & ML interests

None yet

Recent Activity

updated a collection about 2 months ago
ML Theory
updated a collection about 2 months ago
ML Theory
liked a dataset about 1 year ago
ShapeNet/ShapeNetCore
View all activity

Organizations

None yet

Collections 2

ML Theory
  • The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

    Paper • 2509.26507 • Published Sep 30 • 532
  • Muon Outperforms Adam in Tail-End Associative Memory Learning

    Paper • 2509.26030 • Published Sep 30 • 19
  • Why Language Models Hallucinate

    Paper • 2509.04664 • Published Sep 4 • 192
Tokens
  • Is There a Case for Conversation Optimized Tokenizers in Large Language Models?

    Paper • 2506.18674 • Published Jun 23 • 8
ML Theory
  • The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

    Paper • 2509.26507 • Published Sep 30 • 532
  • Muon Outperforms Adam in Tail-End Associative Memory Learning

    Paper • 2509.26030 • Published Sep 30 • 19
  • Why Language Models Hallucinate

    Paper • 2509.04664 • Published Sep 4 • 192
Tokens
  • Is There a Case for Conversation Optimized Tokenizers in Large Language Models?

    Paper • 2506.18674 • Published Jun 23 • 8

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs