Collections
Discover the best community collections!
Collections trending this week
-
The Ultra-Scale Playbook
🌌3.62kThe ultimate guide to training LLM on large GPU Clusters
-
The Smol Training Playbook
📚2.8kThe secrets to building world-class LLMs
-
Evaluation Guidebook
📝228Display benchmark evaluation data for LLMs
-
FineVision: Open Data is All You Need
📝215A new open-source dataset for training VLMs
-
Falcon H1R Playground
🚀12This a chat demo with Falcon-H1R reasoning models.
-
tiiuae/Falcon-H1R-7B
Text Generation • 8B • Updated • 1.22k • 49 -
tiiuae/Falcon-H1R-7B-GGUF
8B • Updated • 1.81k • 18 -
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling
Paper • 2601.02346 • Published • 9
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 33.7k • 76 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 23.8k • • 391 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 519k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 106k • • 742
-
NC-AI-consortium-VAETKI/VAETKI
Text Generation • 112B • Updated • 2.61k • 41 -
LGAI-EXAONE/K-EXAONE-236B-A23B
Text Generation • 237B • Updated • 2.06k • 377 -
naver-hyperclovax/HyperCLOVAX-SEED-Think-32B
Text Generation • 33B • Updated • 29.3k • 150 -
naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B
Text Generation • 11B • Updated • 692 • 126
-
Qwen3 VL Demo
😻340Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 39.2k • • 357 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 222k • • 349 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 5.23k • 24
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 33.7k • 76 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 23.8k • • 391 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 519k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 106k • • 742
-
The Ultra-Scale Playbook
🌌3.62kThe ultimate guide to training LLM on large GPU Clusters
-
The Smol Training Playbook
📚2.8kThe secrets to building world-class LLMs
-
Evaluation Guidebook
📝228Display benchmark evaluation data for LLMs
-
FineVision: Open Data is All You Need
📝215A new open-source dataset for training VLMs
-
NC-AI-consortium-VAETKI/VAETKI
Text Generation • 112B • Updated • 2.61k • 41 -
LGAI-EXAONE/K-EXAONE-236B-A23B
Text Generation • 237B • Updated • 2.06k • 377 -
naver-hyperclovax/HyperCLOVAX-SEED-Think-32B
Text Generation • 33B • Updated • 29.3k • 150 -
naver-hyperclovax/HyperCLOVAX-SEED-Omni-8B
Text Generation • 11B • Updated • 692 • 126
-
Falcon H1R Playground
🚀12This a chat demo with Falcon-H1R reasoning models.
-
tiiuae/Falcon-H1R-7B
Text Generation • 8B • Updated • 1.22k • 49 -
tiiuae/Falcon-H1R-7B-GGUF
8B • Updated • 1.81k • 18 -
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling
Paper • 2601.02346 • Published • 9
-
Qwen3 VL Demo
😻340Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-Text-to-Text • 236B • Updated • 39.2k • • 357 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-Text-to-Text • 236B • Updated • 222k • • 349 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 5.23k • 24