oguzhanercan 's Collections Generation Quality Enhancement
updated
VMix: Improving Text-to-Image Diffusion Model with Cross-Attention
Mixing Control
Paper
• 2412.20800
• Published
• 11
Padding Tone: A Mechanistic Analysis of Padding Tokens in T2I Models
Paper
• 2501.06751
• Published
• 32
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising
Steps
Paper
• 2501.09732
• Published
• 72
Learnings from Scaling Visual Tokenizers for Reconstruction and
Generation
Paper
• 2501.09755
• Published
• 35
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising
Trajectory Sharpening
Paper
• 2502.12146
• Published
• 16
PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference
Time by Leveraging Sparsity
Paper
• 2503.07677
• Published
• 86
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models
Paper
• 2503.18886
• Published
• 24
Alchemist: Turning Public Text-to-Image Data into Generative Gold
Paper
• 2505.19297
• Published
• 84
Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
Paper
• 2506.07986
• Published
• 19
Ambient Diffusion Omni: Training Good Models with Bad Data
Paper
• 2506.10038
• Published
• 9
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable
Text-to-Image Reinforcement Learning
Paper
• 2508.20751
• Published
• 89
Image Tokenizer Needs Post-Training
Paper
• 2509.12474
• Published
• 8
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models
Paper
• 2511.10629
• Published
• 127
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
Paper
• 2511.20256
• Published
• 28