DeepAgent: A General Reasoning Agent with Scalable Toolsets Paper β’ 2510.21618 β’ Published 16 days ago β’ 92
CoSMo: A Multimodal Transformer for Page Stream Segmentation in Comic Books Paper β’ 2507.10053 β’ Published Jul 14 β’ 1
Running 1.16k 1.16k FineWeb: decanting the web for the finest text data at scale π· Generate high-quality text data for LLMs using FineWeb
Running 3.45k 3.45k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
Intern-S1: A Scientific Multimodal Foundation Model Paper β’ 2508.15763 β’ Published Aug 21 β’ 255
Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations Paper β’ 2506.18898 β’ Published Jun 23 β’ 33
Tar Collection [NeurIPS 2025] Unifying Visual Understanding and Generation via Text-Aligned Representations β’ 5 items β’ Updated Sep 20 β’ 16
Open LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 65 items β’ Updated Mar 20 β’ 648