-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper • 2402.17485 • Published • 195 -
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
Paper • 2403.15377 • Published • 26 -
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 3.67M • 1.37k -
google/gemma-3n-E4B-it-litert-preview
Image-Text-to-Text • Updated • 1.48k
mm
W28
·
AI & ML interests
None yet
Organizations
None yet
Ai
-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper • 2402.17485 • Published • 195 -
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
Paper • 2403.15377 • Published • 26 -
nvidia/parakeet-tdt-0.6b-v2
Automatic Speech Recognition • Updated • 3.67M • 1.37k -
google/gemma-3n-E4B-it-litert-preview
Image-Text-to-Text • Updated • 1.48k
models
0
None public yet
datasets
0
None public yet