Running on Zero Featured 282 Qwen Image Edit 2509 👀 Featured 282 Generate edited images based on prompts and input images
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference Optimization Paper • 2411.02712 • Published Nov 5, 2024
Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs Paper • 2506.11515 • Published Jun 13
AI4Research: A Survey of Artificial Intelligence for Scientific Research Paper • 2507.01903 • Published Jul 2 • 4
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 300
M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought Paper • 2405.16473 • Published May 26, 2024
Self-Constructed Context Decompilation with Fined-grained Alignment Enhancement Paper • 2406.17233 • Published Jun 25, 2024