|
|
--- |
|
|
license: apache-2.0 |
|
|
language: |
|
|
- en |
|
|
- zh |
|
|
pipeline_tag: text-to-image |
|
|
library_name: transformers |
|
|
--- |
|
|
|
|
|
**Quantized GGUF Version** |
|
|
|
|
|
**Original Model Link:** [https://huggingface.co/meituan-longcat/LongCat-Image](https://huggingface.co/meituan-longcat/LongCat-Image) |
|
|
|
|
|
**Watch us at Youtube:** [@VantageWithAI](https://www.youtube.com/@vantagewithai) |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://huggingface.co/meituan-longcat/LongCat-Image/resolve/main/assets/longcat-image_logo.svg" width="45%" alt="LongCat-Image" /> |
|
|
</div> |
|
|
<hr> |
|
|
|
|
|
|
|
|
|
|
|
<div align="center" style="line-height: 1;"> |
|
|
<a href='https://github.com/meituan-longcat/LongCat-Image/blob/main/assets/LongCat_Image_Technical_Report.pdf'><img src='https://img.shields.io/badge/Technical-Report-red'></a> |
|
|
<a href='https://github.com/meituan-longcat/LongCat-Image'><img src='https://img.shields.io/badge/GitHub-Code-black'></a> |
|
|
<a href='https://github.com/meituan-longcat/LongCat-Flash-Chat/blob/main/figures/wechat_official_accounts.png'><img src='https://img.shields.io/badge/WeChat-LongCat-brightgreen?logo=wechat&logoColor=white'></a> |
|
|
<a href='https://x.com/Meituan_LongCat'><img src='https://img.shields.io/badge/Twitter-LongCat-white?logo=x&logoColor=white'></a> |
|
|
</div> |
|
|
|
|
|
<div align="center" style="line-height: 1;"> |
|
|
|
|
|
[//]: # ( <a href='https://meituan-longcat.github.io/LongCat-Image/'><img src='https://img.shields.io/badge/Project-Page-green'></a>) |
|
|
<a href='https://huggingface.co/meituan-longcat/LongCat-Image'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-LongCat--Image-blue'></a> |
|
|
<a href='https://huggingface.co/meituan-longcat/LongCat-Image-Dev'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-LongCat--Image--Dev-blue'></a> |
|
|
<a href='https://huggingface.co/meituan-longcat/LongCat-Image-Edit'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-LongCat--Image--Edit-blue'></a> |
|
|
</div> |
|
|
|
|
|
|
|
|
|
|
|
## Introduction |
|
|
We introduce **LongCat-Image**, a pioneering open-source and bilingual (Chinese-English) foundation model for image generation, designed to address core challenges in multilingual text rendering, photorealism, deployment efficiency, and developer accessibility prevalent in current leading models. |
|
|
<div align="center"> |
|
|
<img src="https://huggingface.co/meituan-longcat/LongCat-Image/resolve/main/assets/model_struct.jpg" width="90%" alt="LongCat-Image Generation Examples" /> |
|
|
</div> |
|
|
|
|
|
|
|
|
### Key Features |
|
|
- π **Exceptional Efficiency and Performance**: With only **6B parameters**, LongCat-Image surpasses numerous open-source models that are several times larger across multiple benchmarks, demonstrating the immense potential of efficient model design. |
|
|
- π **Powerful Chinese Text Rendering**: LongCat-Image demonstrates superior accuracy and stability in rendering common Chinese characters compared to existing SOTA open-source models and achieves industry-leading coverage of the Chinese dictionary. |
|
|
- π **Remarkable Photorealism**: Through an innovative data strategy and training framework, LongCat-Image achieves remarkable photorealism in generated images. |
|
|
|
|
|
[//]: # (For more details, please refer to the comprehensive [***LongCat-Image Technical Report***](https://arxiv.org/abs/2412.11963).) |
|
|
|
|
|
## π¨ Showcase |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://huggingface.co/meituan-longcat/LongCat-Image/resolve/main/assets/gallery.jpeg" width="90%" alt="LongCat-Image Generation Examples" /> |
|
|
</div> |
|
|
|