zhehuderek/textual_decisionmaking_data
Viewer
• Updated
• 11k • 12 • 1
VLM with textual-driven GRPO training for vision-grounded decision making (https://arxiv.org/pdf/2503.16965, NeurIPS 2025)
Note This is the textual synthetic data we used for model training.
Note This is the model checkpoint after cold-start math training using GEOQA-8K dataset.
Note This is the model checkpoint after cold-start math training using GEOQA-8K dataset.