Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LLaMA-MoE

https://github.com/pjlab-sys4nlp/llama-moe
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Xiaoye08Ā  submitted a paper about 19 hours ago
LatentMem: Customizing Latent Memory for Multi-Agent Systems
SpicoĀ  authored a paper about 1 month ago
LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Training
SpicoĀ  authored a paper about 1 month ago
Iterative Value Function Optimization for Guided Decoding
View all activity

Tong Zhu's profile picture Xiaoye Qu's profile picture Jiacheng Ruan's profile picture Daize Dong's profile picture tongjingqi(SII)'s profile picture Xuyang Hu's profile picture

llama-moe 's models 8

llama-moe/LLaMA-MoE-v2-3_8B-residual-sft

8B • Updated Dec 3, 2024 • 37 • 2

llama-moe/LLaMA-MoE-v2-3_8B-2_8-sft

8B • Updated Dec 3, 2024 • 469 • 4

llama-moe/LLaMA-MoE-v1-3_0B-2_16

Text Generation • Updated Jun 25, 2024 • 41 • 11

llama-moe/LLaMA-MoE-v1-3_5B-4_16

Text Generation • Updated Jun 25, 2024 • 69 • 16

llama-moe/LLaMA-MoE-v1-3_0B-2_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 1 • 2

llama-moe/LLaMA-MoE-v1-3_5B-2_8-sft

Text Generation • 7B • Updated Jun 25, 2024 • 1 • 3

llama-moe/LLaMA-MoE-v1-3_5B-4_16-sft

Text Generation • 7B • Updated Jun 25, 2024 • 2 • 1

llama-moe/LLaMA-MoE-v1-3_5B-2_8

Text Generation • Updated Jun 25, 2024 • 181 • 15
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs