AT^2PO: Agentic Turn-based Policy Optimization via Tree Search Paper • 2601.04767 • Published 28 days ago • 28
view article Article Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted Captions Nov 19, 2024 • 3