10 Open Challenges Steering the Future of Vision-Language-Action Models Paper • 2511.05936 • Published 4 days ago • 3
DigiData: Training and Evaluating General-Purpose Mobile Control Agents Paper • 2511.07413 • Published 1 day ago • 3
RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization Paper • 2511.04285 • Published 5 days ago • 4
SofT-GRPO: Surpassing Discrete-Token LLM Reinforcement Learning via Gumbel-Reparameterized Soft-Thinking Policy Optimization Paper • 2511.06411 • Published 2 days ago • 13
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published 1 day ago • 14
IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction Paper • 2511.07327 • Published 1 day ago • 57
HaluMem: Evaluating Hallucinations in Memory Systems of Agents Paper • 2511.03506 • Published 6 days ago • 70
EVTAR: End-to-End Try on with Additional Unpaired Visual Reference Paper • 2511.00956 • Published 9 days ago • 4
SAIL-RL: Guiding MLLMs in When and How to Think via Dual-Reward RL Tuning Paper • 2511.02280 • Published 8 days ago • 2
SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding Paper • 2511.04668 • Published 5 days ago • 4
Too Good to be Bad: On the Failure of LLMs to Role-Play Villains Paper • 2511.04962 • Published 5 days ago • 44
Towards Mitigating Hallucinations in Large Vision-Language Models by Refining Textual Embeddings Paper • 2511.05017 • Published 5 days ago • 5
TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models Paper • 2511.02802 • Published 7 days ago • 12
Orion-MSP: Multi-Scale Sparse Attention for Tabular In-Context Learning Paper • 2511.02818 • Published 7 days ago • 13
UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions Paper • 2511.03334 • Published 6 days ago • 48
Learning Vision-Driven Reactive Soccer Skills for Humanoid Robots Paper • 2511.03996 • Published 6 days ago • 3