3 15 2

ChengpengLi

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper about 2 months ago

Qwen3-VL Technical Report

upvoted a paper about 2 months ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

upvoted a paper 4 months ago

Agentic Entropy-Balanced Policy Optimization

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published Nov 26, 2025 • 152

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 64

upvoted 2 papers 4 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16, 2025 • 106

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26, 2025 • 118

upvoted 2 papers 6 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14, 2025 • 144

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26, 2025 • 158

upvoted a paper 9 months ago

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22, 2025 • 58

upvoted a paper 11 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6, 2025 • 113

upvoted 3 papers about 1 year ago

upvoted 2 collections over 1 year ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated Dec 31, 2025 • 89

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 8 items • Updated Dec 31, 2025 • 52

upvoted 2 papers over 1 year ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 17

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 21

ChengpengLi

AI & ML interests

Recent Activity

Organizations

ChengpengLi's activity