Running on Zero Featured 97 SAM3 Video Segmentation 🐠 97 Track and label objects in videos using text prompts or clicks
YOLO-World: Real-Time Open-Vocabulary Object Detection Paper • 2401.17270 • Published Jan 30, 2024 • 42
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 20 days ago • 274k • 1.55k