Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 655
Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark Paper • 2311.09122 • Published Nov 15, 2023 • 8
view article Article Introducing EuroBERT: A High-Performance Multilingual Encoder Model Mar 10 • 146
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5 • 294
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated May 5 • 238
view article Article Docmatix - a huge dataset for Document Visual Question Answering Jul 18, 2024 • 78
view article Article Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality Jun 24, 2024 • 34
Leaderboards and benchmarks ✨ Collection Cool leaderboard spaces collection for models across modalities! Text, vision, audio, ... • 91 items • Updated Feb 28 • 114