zhehuderek/textual_decisionmaking_data
Viewer
•
Updated
•
11k
•
9
•
1
VLM with textual-driven GRPO training for vision-grounded decision making (https://arxiv.org/pdf/2503.16965, NeurIPS 2025)
Note This is the textual synthetic data we used for model training.
Note This is the model checkpoint after cold-start math training using GEOQA-8K dataset.
Note This is the model checkpoint after cold-start math training using GEOQA-8K dataset.