Xinyu Zhu's picture

11 10

Xinyu Zhu

TianHongZXY

·

https://zhuxinyu.top

AI & ML interests

Large Language Models; Reasoning; Reinforcement Learning

Recent Activity

published a model 5 days ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

updated a model 5 days ago

TianHongZXY/Qwen3-4B-Thinking-2507-SFT-10-epochs-synthesized-clear-problems-global_step_280

upvoted a paper about 1 month ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

View all activity

Organizations

authored a paper about 1 month ago

RAST: Reasoning Activation in LLMs via Small-model Transfer

Paper • 2506.15710 • Published May 30

authored a paper 5 months ago

The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning

Paper • 2506.01347 • Published Jun 2 • 3

authored 3 papers about 1 year ago

ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

Paper • 2406.20015 • Published Jun 28, 2024 • 1

A Survey on the Honesty of Large Language Models

Paper • 2409.18786 • Published Sep 27, 2024 • 32

HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

Paper • 2406.11683 • Published Jun 17, 2024

authored 7 papers over 1 year ago

Zero-Shot Learners for Natural Language Understanding via a Unified Multiple Choice Perspective

Paper • 2210.08590 • Published Oct 16, 2022

Solving Math Word Problems via Cooperative Reasoning induced Language Models

Paper • 2210.16257 • Published Oct 28, 2022

Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence

Paper • 2209.02970 • Published Sep 7, 2022

Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast

Paper • 2405.14507 • Published May 23, 2024

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

Paper • 2406.09961 • Published Jun 14, 2024 • 55

Question Answering as Programming for Solving Time-Sensitive Questions

Paper • 2305.14221 • Published May 23, 2023

AutoConv: Automatically Generating Information-seeking Conversations with Large Language Models

Paper • 2308.06507 • Published Aug 12, 2023 • 1