Running 6 6 Dolphin: Efficient Audio-Visual Speech Separation with Discrete Lip Semantics and Multi-Scale Global-Local Attention 👀 Separate speakers in videos
Running on Zero MCP 106 106 TIGER Audio Extractor ✂ Extraction & Reconstruction for Efficient Speech Separation