MeViS Collection MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation ⢠2 items ⢠Updated 5 days ago
OmniAVS Collection Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation ⢠3 items ⢠Updated Sep 28