UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist Paper • 2511.08521 • Published 8 days ago • 36
Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 6 days ago • 39
Depth Anything 3: Recovering the Visual Space from Any Views Paper • 2511.10647 • Published 6 days ago • 67
Music Flamingo: Scaling Music Understanding in Audio Language Models Paper • 2511.10289 • Published 6 days ago • 9