CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model Paper • 2503.06472 • Published Mar 9 • 8
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning Paper • 2506.10963 • Published Jun 12 • 9