Collections
Discover the best community collections!
Collections including paper arxiv:2302.05543
-
coqui/XTTS-v2
Text-to-Speech • Updated • 4.96M • 3.17k -
deepseek-ai/DeepSeek-V3-0324
Text Generation • 685B • Updated • 213k • • 3.08k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.18M • • 5.1k -
Distilling an End-to-End Voice Assistant Without Instruction Training Data
Paper • 2410.02678 • Published • 23
-
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 57 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 33 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 14
-
coqui/XTTS-v2
Text-to-Speech • Updated • 4.96M • 3.17k -
deepseek-ai/DeepSeek-V3-0324
Text Generation • 685B • Updated • 213k • • 3.08k -
openai/whisper-large-v3
Automatic Speech Recognition • 2B • Updated • 4.18M • • 5.1k -
Distilling an End-to-End Voice Assistant Without Instruction Training Data
Paper • 2410.02678 • Published • 23
-
Adding Conditional Control to Text-to-Image Diffusion Models
Paper • 2302.05543 • Published • 57 -
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
Paper • 2308.06721 • Published • 33 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 14