Penalizing Infeasible Actions and Reward Scaling in Reinforcement Learning with Offline Data Paper • 2507.08761 • Published Jul 11 • 1
Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making Paper • 2310.03022 • Published Oct 4, 2023
ReflAct: World-Grounded Decision Making in LLM Agents via Goal-State Reflection Paper • 2505.15182 • Published May 21 • 6
Adaptive $Q$-Aid for Conditional Supervised Learning in Offline Reinforcement Learning Paper • 2402.02017 • Published Feb 3, 2024
LESSON: Learning to Integrate Exploration Strategies for Reinforcement Learning via an Option Framework Paper • 2310.03342 • Published Oct 5, 2023