9 22 10

Ziqi Huang

Ziqi

https://ziqihuangg.github.io/

AI & ML interests

Computer Vision, Generative Model, Image Generation, Video Generation, World Model

Recent Activity

upvoted a paper 6 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

upvoted a paper 11 days ago

Exploring MLLM-Diffusion Information Transfer with MetaCanvas

liked a Space 17 days ago

worldbench/WorldLens

View all activity

Organizations

upvoted a paper 6 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 7 days ago • 61

upvoted a paper 11 days ago

Exploring MLLM-Diffusion Information Transfer with MetaCanvas

Paper • 2512.11464 • Published 17 days ago • 12

upvoted 2 papers about 1 month ago

PhysX-Anything: Simulation-Ready Physical 3D Assets from Single Image

Paper • 2511.13648 • Published Nov 17 • 52

Simulating the Visual World with Artificial Intelligence: A Roadmap

Paper • 2511.08585 • Published Nov 11 • 29

upvoted a paper about 2 months ago

The Quest for Generalizable Motion Generation: Data, Model, and Evaluation

Paper • 2510.26794 • Published Oct 30 • 26

upvoted 2 papers 2 months ago

RealDPO: Real or Not Real, that is the Preference

Paper • 2510.14955 • Published Oct 16 • 6

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15 • 9

upvoted 2 papers 3 months ago

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Paper • 2510.05094 • Published Oct 6 • 37

Stencil: Subject-Driven Generation with Context Guidance

Paper • 2509.17120 • Published Sep 21 • 6

upvoted a paper 4 months ago

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Paper • 2508.15774 • Published Aug 21 • 20

upvoted a paper 5 months ago

Cut2Next: Generating Next Shot via In-Context Tuning

Paper • 2508.08244 • Published Aug 11 • 13

upvoted 2 papers 6 months ago

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Paper • 2506.21356 • Published Jun 26 • 22

Ego-R1: Chain-of-Tool-Thought for Ultra-Long Egocentric Video Reasoning

Paper • 2506.13654 • Published Jun 16 • 43

upvoted 2 papers 9 months ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27 • 33

CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models

Paper • 2503.18886 • Published Mar 24 • 24

upvoted a paper 11 months ago

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15 • 15

upvoted 2 papers about 1 year ago

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Paper • 2412.09645 • Published Dec 10, 2024 • 36

VBench++: Comprehensive and Versatile Benchmark Suite for Video Generative Models

Paper • 2411.13503 • Published Nov 20, 2024 • 34

upvoted 2 papers about 2 years ago

FreeInit: Bridging Initialization Gap in Video Diffusion Models

Paper • 2312.07537 • Published Dec 12, 2023 • 27

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 9

Ziqi Huang

AI & ML interests

Recent Activity

Organizations

Ziqi's activity