Jiazheng Xu's picture

3 7 20

Jiazheng Xu

xujz0703

·

AI & ML interests

None yet

Recent Activity

upvoted an article 8 days ago

Why Did MiniMax M2 End Up as a Full Attention Model?

upvoted a paper 21 days ago

Glyph: Scaling Context Windows via Visual-Text Compression

upvoted an article 2 months ago

From GRPO to DAPO and GSPO: What, Why, and How

View all activity

Organizations

upvoted an article 8 days ago

Article

Why Did MiniMax M2 End Up as a Full Attention Model?

By

•

11 days ago

• 52

upvoted a paper 21 days ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published 21 days ago • 66

upvoted an article 2 months ago

Article

From GRPO to DAPO and GSPO: What, Why, and How

By

•

Aug 9

• 52

upvoted a paper 4 months ago

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1 • 237

upvoted 2 papers 8 months ago

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11 • 17

ImageReward: Learning and Evaluating Human Preferences for Text-to-Image Generation

Paper • 2304.05977 • Published Apr 12, 2023 • 3

upvoted a paper 10 months ago

VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Paper • 2412.21059 • Published Dec 30, 2024 • 18