AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents Paper • 2407.04363 • Published Jul 5, 2024 • 34
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning Paper • 2508.19828 • Published Aug 27, 2025 • 8
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference Paper • 2511.10645 • Published Nov 13, 2025 • 6
Discover and Cure: Concept-aware Mitigation of Spurious Correlation Paper • 2305.00650 • Published May 1, 2023 • 1
P-EAGLE: Parallel-Drafting EAGLE with Scalable Training Paper • 2602.01469 • Published 10 days ago • 1
DFlash: Block Diffusion for Flash Speculative Decoding Paper • 2602.06036 • Published 7 days ago • 40
HiPO Collection Adaptive reasoning LLMs based on the HiPO framework, featuring dynamic “think-on / think-off” control for efficient reasoning. • 2 items • Updated Nov 3, 2025 • 3
InteractScience: Programmatic and Visually-Grounded Evaluation of Interactive Scientific Demonstration Code Generation Paper • 2510.09724 • Published Oct 10, 2025 • 11
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning Paper • 2510.27606 • Published Oct 31, 2025 • 30
Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation Paper • 2512.20908 • Published Dec 24, 2025 • 28
Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery Paper • 2508.17380 • Published Aug 24, 2025 • 7
DrugReasoner: Interpretable Drug Approval Prediction with a Reasoning-augmented Language Model Paper • 2508.18579 • Published Aug 26, 2025 • 14
Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal Paper • 2508.05988 • Published Aug 8, 2025 • 21