Models Qwen/Qwen2.5-Omni-7B Any-to-Any • 11B • Updated Apr 30 • 182k • 1.81k deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27 • 408k • • 12.8k upstage/TinySolar-248m-4k Text Generation • 0.2B • Updated Feb 7, 2024 • 476 • 8 upstage/TinySolar-248m-4k-code-instruct Text Generation • 0.2B • Updated Apr 19, 2024 • 85 • 8
datasets HuggingFaceH4/llava-instruct-mix-vsft Viewer • Updated Apr 11, 2024 • 273k • 2.3k • 47 togethercomputer/RedPajama-Data-1T Viewer • Updated Jun 17, 2024 • 1.73M • 1.18k • 1.11k
Models Qwen/Qwen2.5-Omni-7B Any-to-Any • 11B • Updated Apr 30 • 182k • 1.81k deepseek-ai/DeepSeek-R1 Text Generation • 685B • Updated Mar 27 • 408k • • 12.8k upstage/TinySolar-248m-4k Text Generation • 0.2B • Updated Feb 7, 2024 • 476 • 8 upstage/TinySolar-248m-4k-code-instruct Text Generation • 0.2B • Updated Apr 19, 2024 • 85 • 8
datasets HuggingFaceH4/llava-instruct-mix-vsft Viewer • Updated Apr 11, 2024 • 273k • 2.3k • 47 togethercomputer/RedPajama-Data-1T Viewer • Updated Jun 17, 2024 • 1.73M • 1.18k • 1.11k