mLLMs_merging_4_DMO
Collection
Official checkpoints from the paper "Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization". • 22 items • Updated
This is an official checkpoint from the paper: "Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization " (link). See the official implementation for more information on how to use the models.
This repo contains fine-tuned versions of Qwen/Qwen2-VL-2B on diverse dataset mixtures of Chart, Counting, GeneralVQA, and OCR data (~100k samples).
The following hyperparameters were used during training:
Base model
Qwen/Qwen2-VL-2B