Jie Chen's picture

15 8 2

Jie Chen

survivi

·

survivi

AI & ML interests

Large Language Model, Natural Language Processing

Recent Activity

authored a paper 1 day ago

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

authored a paper 1 day ago

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

authored a paper 1 day ago

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework

View all activity

Organizations

authored 4 papers 1 day ago

From Trial-and-Error to Improvement: A Systematic Analysis of LLM Exploration Mechanisms in RLVR

Paper • 2508.07534 • Published Aug 11, 2025 • 1

Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models

Paper • 2406.12397 • Published Jun 18, 2024

Sticker-TTS: Learn to Utilize Historical Experience with a Sticker-driven Test-Time Scaling Framework

Paper • 2509.05007 • Published Sep 5, 2025

Decomposing the Entropy-Performance Exchange: The Missing Keys to Unlocking Effective Reinforcement Learning

Paper • 2508.02260 • Published Aug 4, 2025

authored 8 papers 11 months ago

The Dawn After the Dark: An Empirical Study on Factuality Hallucination in Large Language Models

Paper • 2401.03205 • Published Jan 6, 2024

LLMBox: A Comprehensive Library for Large Language Models

Paper • 2407.05563 • Published Jul 8, 2024

Towards Effective and Efficient Continual Pre-training of Large Language Models

Paper • 2407.18743 • Published Jul 26, 2024

Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search

Paper • 2411.11694 • Published Nov 18, 2024

Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems

Paper • 2412.09413 • Published Dec 12, 2024 • 1

YuLan-Mini: An Open Data-efficient Language Model

Paper • 2412.17743 • Published Dec 23, 2024 • 66

An Empirical Study on Eliciting and Improving R1-like Reasoning Models

Paper • 2503.04548 • Published Mar 6, 2025 • 9

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7, 2025 • 27