EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling Paper ⢠2509.23909 ⢠Published Sep 28 ⢠31
OmniGen2: Exploration to Advanced Multimodal Generation Paper ⢠2506.18871 ⢠Published Jun 23 ⢠77
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces Paper ⢠2506.00123 ⢠Published May 30 ⢠35
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis Paper ⢠2307.01952 ⢠Published Jul 4, 2023 ⢠90
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning Paper ⢠2501.04698 ⢠Published Jan 8 ⢠15
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper ⢠2501.05441 ⢠Published Jan 9 ⢠95
view article Article StableV2V: Stablizing Shape Consistency in Video-to-Video Editing Nov 19, 2024 ⢠2
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing Paper ⢠2411.11045 ⢠Published Nov 17, 2024 ⢠11
iDesigner: A High-Resolution and Complex-Prompt Following Text-to-Image Diffusion Model for Interior Design Paper ⢠2312.04326 ⢠Published Dec 7, 2023 ⢠3
A Systematic Review of Deep Learning-based Research on Radiology Report Generation Paper ⢠2311.14199 ⢠Published Nov 23, 2023 ⢠2
LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis Paper ⢠2305.11520 ⢠Published May 19, 2023 ⢠1
Towards Interactive Image Inpainting via Sketch Refinement Paper ⢠2306.00407 ⢠Published Jun 1, 2023 ⢠2