docling-project/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated Sep 17 • 374k • 1.59k
Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21 • 59.4k • • 1.24k
TPTT: Transforming Pretrained Transformer into Titans Paper • 2506.17671 • Published Jun 21 • 5