File size: 3,531 Bytes

---
license: apache-2.0
datasets:
- MCG-NJU/X-Dance
base_model:
- Wan-AI/Wan2.1-I2V-14B-480P
pipeline_tag: image-to-video
library_name: diffusers
---

<p align="center">

  <h2 align="center">SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation</h2>
  <p align="center">
    <a href="https://scholar.google.com/citations?hl=en&user=0lLB3fsAAAAJ"><strong>Jiaming Zhang</strong></a>
    ·
    <a href="https://dblp.org/pid/316/8117.html"><strong>Shengming Cao</strong></a>
    ·
    <a href="https://qianduoduolr.github.io/"><strong>Rui Li</strong></a>
    ·
    <a href="https://openreview.net/profile?id=~Xiaotong_Zhao1"><strong>Xiaotong Zhao</strong></a>
    ·
    <a href="https://scholar.google.com/citations?user=TSMchWcAAAAJ&hl=en&oi=ao"><strong>Yutao Cui</strong></a>
    <br>
    <a href=""><strong>Xinglin Hou</strong></a>
    ·
    <a href="https://mcg.nju.edu.cn/member/gswu/en/index.html"><strong>Gangshan Wu</strong></a>
    ·
    <a href="https://openreview.net/profile?id=~Haolan_Chen1"><strong>Haolan Chen</strong></a>
    ·
    <a href="https://scholar.google.com/citations?user=FHvejDIAAAAJ"><strong>Yu Xu</strong></a> 
    ·
    <a href="https://scholar.google.com/citations?user=TSMchWcAAAAJ&hl=en&oi=ao"><strong>Limin Wang</strong></a>
    ·
    <a href="https://openreview.net/profile?id=~Kai_Ma4"><strong>Kai Ma</strong></a>
    <br>
    <br>
        <a href="https://arxiv.org/abs/2511.19320"><img src='https://img.shields.io/badge/arXiv-2511.19320-red' alt='Paper PDF'></a>
        <a href='https://mcg-nju.github.io/steadydancer-web'><img src='https://img.shields.io/badge/Project-Page-blue' alt='Project Page'></a>
        <a href='https://github.com/MCG-NJU/SteadyDancer'><img src='https://img.shields.io/badge/Github-SteadyDancer-orange'></a>
        <a href='https://huggingface.co/MCG-NJU/SteadyDancer-14B'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-Model-yellow'></a>
        <a href='https://huggingface.co/datasets/MCG-NJU/X-Dance'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-X--Dance-green'></a>
    <br>
    <b></a>Multimedia Computing Group, Nanjing University &nbsp; | &nbsp; </a>Platform and Content Group (PCG), Tencent  </b>
    <br>
  </p>
</p>

This repository is the `checkpoint` of paper "SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation". SteadyDancer is a strong animation framework based on **Image-to-Video paradigm**, ensuring **robust first-frame preservation**. In contrast to prior *Reference-to-Video* approaches that often suffer from identity drift due to **spatio-temporal misalignments** common in real-world applications, SteadyDancer generates **high-fidelity and temporally coherent** human animations, outperforming existing methods in visual quality and control while **requiring significantly fewer training resources**.

![teaser](assets/teaser.png?raw=true)


## 📚 Citation

If you find our paper or this codebase useful for your research, please cite us.
```BibTeX
@misc{zhang2025steadydancer,
      title={SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation}, 
      author={Jiaming Zhang and Shengming Cao and Rui Li and Xiaotong Zhao and Yutao Cui and Xinglin Hou and Gangshan Wu and Haolan Chen and Yu Xu and Limin Wang and Kai Ma},
      year={2025},
      eprint={2511.19320},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2511.19320}, 
}
```