Xinyu Zhu

TianHongZXY

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

published a model 5 days ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

updated a model 5 days ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

View all activity

Organizations

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 54

upvoted a paper 2 months ago

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Paper • 2508.18106 • Published Aug 25 • 342

upvoted a collection 4 months ago

RLVR-Decomposed

Collection

The collection for the Paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning" • 9 items • Updated Jun 1 • 3

upvoted a collection 5 months ago

AdaDecode

Collection

[ICML 2025] AdaDecode: Accelerating LLM Decoding with Adaptive Layer Parallelism. • 18 items • Updated Jun 4 • 3

upvoted a paper 5 months ago

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

Paper • 2506.01347 • Published Jun 2 • 3

upvoted a paper 6 months ago

WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning

Paper • 2505.16421 • Published May 22 • 19

upvoted an article 8 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

upvoted a collection 8 months ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated Apr 30 • 103

upvoted an article 9 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 885

upvoted a paper about 1 year ago

A Survey on the Honesty of Large Language Models

Paper • 2409.18786 • Published Sep 27, 2024 • 32

upvoted a paper over 1 year ago

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Paper • 2406.09961 • Published Jun 14, 2024 • 55

Xinyu Zhu

AI & ML interests

Recent Activity

Organizations

TianHongZXY's activity

Open R1: Update #3

Open-R1: a fully open reproduction of DeepSeek-R1