microsoft/Phi-4-multimodal-instruct
Automatic Speech Recognition
•
6B
•
Updated
•
274k
•
1.55k
Try on clothes on a person image
Upgraded to v1.0!
Scalable and Versatile 3D Generation from images
Audio Conditioned LipSync with Latent Diffusion Models
View AI model releases for 2024