ACG: Action Coherence Guidance for Flow-based VLA models Paper • 2510.22201 • Published 21 days ago • 36
DesignLab: Designing Slides Through Iterative Detection and Correction Paper • 2507.17202 • Published Jul 23 • 50
Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models Paper • 2506.00996 • Published Jun 1 • 38
SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation Paper • 2504.14396 • Published Apr 19 • 27
Scaling Up Personalized Aesthetic Assessment via Task Vector Customization Paper • 2407.07176 • Published Jul 9, 2024 • 6
Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model Paper • 2309.03550 • Published Sep 7, 2023 • 12
Learning to Generate Semantic Layouts for Higher Text-Image Correspondence in Text-to-Image Synthesis Paper • 2308.08157 • Published Aug 16, 2023 • 2