trl-internal-testing/tiny-DeepseekV3ForCausalLM Text Generation • 5.52M • Updated 8 days ago • 509 • 3
TorpedoSoftware/Luau-Devstral-24B-Instruct-v0.1 Text Generation • 24B • Updated 21 days ago • 1.32k • 3
HectorHe/Deepseek-V2-13B-Math7K-Expert-Enhance-Subset-Expert-MoE-32-experts Text Generation • 16B • Updated Aug 18 • 13 • 1
mradermacher/Deepseek-V2-13B-Math7K-Expert-Enhance-Subset-Expert-MoE-32-experts-GGUF 16B • Updated Aug 19 • 231 • 1
mradermacher/Deepseek-V2-13B-Math7K-Expert-Enhance-Subset-Expert-MoE-32-experts-i1-GGUF 16B • Updated 13 days ago • 8.17k • 1