arXiv:2509.24897
Yuran Wang
Ryann829
·
AI & ML interests
Multimodal Large Language Model
Recent Activity
authored
a paper
18 days ago
Ocean-OCR: Towards General OCR Application via a Vision-Language Model
authored
a paper
18 days ago
DualToken: Towards Unifying Visual Understanding and Generation with
Dual Visual Vocabularies