-
ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality
Paper • 2510.22037 • Published • 18 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 531 -
Scaling Language-Centric Omnimodal Representation Learning
Paper • 2510.11693 • Published • 97
Collections
Discover the best community collections!
Collections including paper arxiv:2510.03279
-
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning
Paper • 2510.14265 • Published • 19 -
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
Paper • 2510.15110 • Published • 15 -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
Learning Optimal Predictive Checklists
Paper • 2112.01020 • Published • 1
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 262 -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning
Paper • 2509.23768 • Published • 48 -
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions
Paper • 2510.08211 • Published • 22
-
HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
Paper • 2510.05560 • Published • 7 -
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Paper • 2510.06217 • Published • 62 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48 -
Improving Context Fidelity via Native Retrieval-Augmented Reasoning
Paper • 2509.13683 • Published • 8 -
Multimodal Iterative RAG for Knowledge-Intensive Visual Question Answering
Paper • 2509.00798 • Published • 1
-
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
Paper • 2509.09926 • Published • 13 -
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge
Paper • 2508.08344 • Published -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48
-
Intern-S1: A Scientific Multimodal Foundation Model
Paper • 2508.15763 • Published • 256 -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Paper • 2510.05034 • Published • 46 -
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
Paper • 2510.11696 • Published • 173
-
ATLAS: Adaptive Transfer Scaling Laws for Multilingual Pretraining, Finetuning, and Decoding the Curse of Multilinguality
Paper • 2510.22037 • Published • 18 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain
Paper • 2509.26507 • Published • 531 -
Scaling Language-Centric Omnimodal Representation Learning
Paper • 2510.11693 • Published • 97
-
MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning
Paper • 2510.14265 • Published • 19 -
DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning
Paper • 2510.15110 • Published • 15 -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
Learning Optimal Predictive Checklists
Paper • 2112.01020 • Published • 1
-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48 -
Improving Context Fidelity via Native Retrieval-Augmented Reasoning
Paper • 2509.13683 • Published • 8 -
Multimodal Iterative RAG for Knowledge-Intensive Visual Question Answering
Paper • 2509.00798 • Published • 1
-
Agent Learning via Early Experience
Paper • 2510.08558 • Published • 262 -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning
Paper • 2509.23768 • Published • 48 -
LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions
Paper • 2510.08211 • Published • 22
-
HoloScene: Simulation-Ready Interactive 3D Worlds from a Single Video
Paper • 2510.05560 • Published • 7 -
TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning
Paper • 2510.06217 • Published • 62 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 483 -
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 52
-
LoFT: Parameter-Efficient Fine-Tuning for Long-tailed Semi-Supervised Learning in Open-World Scenarios
Paper • 2509.09926 • Published • 13 -
What Breaks Knowledge Graph based RAG? Empirical Insights into Reasoning under Incomplete Knowledge
Paper • 2508.08344 • Published -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48
-
Intern-S1: A Scientific Multimodal Foundation Model
Paper • 2508.15763 • Published • 256 -
MemMamba: Rethinking Memory Patterns in State Space Model
Paper • 2510.03279 • Published • 72 -
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
Paper • 2510.05034 • Published • 46 -
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
Paper • 2510.11696 • Published • 173