ChartQwen
Model Description
ChartQwen is a vision-language model fine-tuned from Qwen/Qwen-VL for chart understanding tasks.
The model is designed to interpret visual charts such as bar charts, line graphs, and plots, and answer natural language questions related to them.
It supports multimodal reasoning by jointly processing images and text prompts.
Intended Use
This model can be used for:
- Chart question answering
- Chart data interpretation
- Visual reasoning over plots and graphs
- Document and report analysis involving charts
Training Details
- Base model: Qwen/Qwen-VL
- Modality: Image + Text
- Fine-tuning type: Supervised fine-tuning on chart-related visual-question pairs
- Dataset: Custom chart dataset (generated and curated for chart understanding)
Limitations
- Performance may degrade on low-resolution or highly cluttered charts
- The model may struggle with handwritten charts or uncommon chart styles
- Numerical precision depends on chart clarity
Model tree for Sayeem26s/Chartqwen
Base model
Qwen/Qwen-VL