Image-Text-to-Text
Transformers
Safetensors
zwcolin commited on
Commit
fed384b
·
verified ·
1 Parent(s): 686282c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -1,4 +1,18 @@
1
  ---
2
  license: apache-2.0
3
  ---
4
- arxiv.org/abs/2601.16973
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ ## Citation
5
+
6
+ If you use this model, please cite:
7
+
8
+ ```bibtex
9
+ @misc{wang2026visgymdiversecustomizablescalable,
10
+ title={VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents},
11
+ author={Zirui Wang and Junyi Zhang and Jiaxin Ge and Long Lian and Letian Fu and Lisa Dunlap and Ken Goldberg and XuDong Wang and Ion Stoica and David M. Chan and Sewon Min and Joseph E. Gonzalez},
12
+ year={2026},
13
+ eprint={2601.16973},
14
+ archivePrefix={arXiv},
15
+ primaryClass={cs.CV},
16
+ url={https://arxiv.org/abs/2601.16973},
17
+ }
18
+ ```