Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

sumitdotml
/
moe-emergence

Text Generation
Transformers
Safetensors
English
mixture-of-experts
gpt2
research
expert-specialization
Model card Files Files and versions
xet
Community
moe-emergence
  • 2 contributors
History: 18 commits
sumit
updated model card with ablation results and all 4 runs
4049aa7 9 days ago
  • dense-baseline
    add dense and moe checkpoints 10 days ago
  • moe-main
    add dense and moe checkpoints 10 days ago
  • no-lb-ablation
    Upload no-lb-ablation/ckpt-step-500.pt with huggingface_hub 9 days ago
  • top2-main-10k
    Upload top2-main-10k/ckpt-step-9999.pt with huggingface_hub 9 days ago
  • .gitattributes
    1.52 kB
    add dense and moe checkpoints 10 days ago
  • README.md
    7.09 kB
    updated model card with ablation results and all 4 runs 9 days ago