Running 3.48k 3.48k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
deepset/roberta-base-squad2 Question Answering β’ 0.1B β’ Updated Sep 24, 2024 β’ 687k β’ β’ 927
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF Text Generation β’ 8B β’ Updated May 10 β’ 34.3k β’ 290