gen-audio ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25, 2024 • 60
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25, 2024 • 60
lmm mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper • 2403.12895 • Published Mar 19, 2024 • 32 MiniCPM-V: A GPT-4V Level MLLM on Your Phone Paper • 2408.01800 • Published Aug 3, 2024 • 89 Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 29
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper • 2403.12895 • Published Mar 19, 2024 • 32
Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 29
gen-audio ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25, 2024 • 60
ChatMusician: Understanding and Generating Music Intrinsically with LLM Paper • 2402.16153 • Published Feb 25, 2024 • 60
lmm mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper • 2403.12895 • Published Mar 19, 2024 • 32 MiniCPM-V: A GPT-4V Level MLLM on Your Phone Paper • 2408.01800 • Published Aug 3, 2024 • 89 Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 29
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding Paper • 2403.12895 • Published Mar 19, 2024 • 32
Phantom of Latent for Large Language and Vision Models Paper • 2409.14713 • Published Sep 23, 2024 • 29