arxiv:2503.18860
Xiaozhong Ji
xiaozhongji
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
BaseReward: A Strong Baseline for Multimodal Reward Model
updated
a model
2 months ago
xiaozhongji/qwen2_5_vl_7b_mmrl30k_grpo
published
a model
2 months ago
xiaozhongji/qwen2_5_vl_7b_mmrl30k_grpo
Organizations
None yet