In-situ Value-aligned Human-Robot Interactions with Physical Constraints Paper • 2508.07606 • Published Aug 11 • 1
Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning Paper • 2512.07461 • Published 4 days ago • 68
RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling Paper • 2506.08672 • Published Jun 10 • 30