-
LLM Agent Operating System
Paper • 2403.16971 • Published • 72 -
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
Paper • 2508.05629 • Published • 178 -
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Paper • 2508.01191 • Published • 236 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 258
Collections
Discover the best community collections!
Collections including paper arxiv:2507.19849
-
LLM Agent Operating System
Paper • 2403.16971 • Published • 72 -
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
Paper • 2508.05629 • Published • 178 -
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Paper • 2508.01191 • Published • 236 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 258