WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model Paper • 2308.15962 • Published Aug 30, 2023
Unified Lexical Representation for Interpretable Visual-Language Alignment Paper • 2407.17827 • Published Jul 25, 2024 • 1
ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning Paper • 2512.02835 • Published 14 days ago • 9