google/siglip2-base-patch16-512 Zero-Shot Image Classification • 0.4B • Updated Feb 21, 2025 • 73.7k • 34
Running on Zero MCP Featured 809 Whisper Large V3 🤫 809 Transcribe speech from audio or YouTube videos into text