Running on Zero Featured 96 SAM3 Video Segmentation 🐠 96 Track and label objects in videos using text prompts or clicks
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 19 days ago • 274k • 1.55k
Running on Zero MCP Featured 206 ViTPose Transformers ⚡ 206 Detect and estimate human poses in images and videos
Running on Zero Featured 571 Chat with DeepSeek-VL2-small 🌍 571 Generate responses using images and text input
Running on Zero Featured 111 VLM Object Understanding 🦀 111 Explore object detection, visual grounding, keypoint Detecti