Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning Paper • 2602.04284 • Published 3 days ago • 12
Not All Thoughts are Generated Equal: Efficient LLM Reasoning via Multi-Turn Reinforcement Learning Paper • 2505.11827 • Published May 17, 2025 • 1