QizhiPei's picture

QizhiPei

QizhiPei

·

https://qizhipei.github.io/

QizhiPei

AI & ML interests

AI4Science, LLM, Data Synthesis

Recent Activity

liked a dataset 5 days ago

OpenDataArena/ODA-Mixture-500k

liked a dataset 5 days ago

OpenDataArena/ODA-Mixture-100k

liked a dataset 5 days ago

OpenDataArena/ODA-Math-460k

View all activity

Organizations

upvoted a paper 6 days ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published 7 days ago • 92

upvoted a paper 20 days ago

OpenDataArena: A Fair and Open Arena for Benchmarking Post-Training Dataset Value

Paper • 2512.14051 • Published 21 days ago • 40

upvoted a paper about 1 month ago

Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights

Paper • 2512.01816 • Published Dec 1, 2025 • 88

upvoted a collection about 1 month ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 13 days ago • 156

upvoted a paper about 2 months ago

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models

Paper • 2511.11134 • Published Nov 14, 2025 • 31

upvoted 3 papers 3 months ago

Scaling Code-Assisted Chain-of-Thoughts and Instructions for Model Reasoning

Paper • 2510.04081 • Published Oct 5, 2025 • 23

DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

Paper • 2509.26603 • Published Sep 30, 2025 • 16

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 139

upvoted a collection 3 months ago

ScaleDiff

Data & Models for paper "ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning" • 6 items • Updated Sep 26, 2025 • 1

upvoted a paper 3 months ago

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning

Paper • 2509.21070 • Published Sep 25, 2025 • 9

upvoted a paper 4 months ago

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning

Paper • 2508.21589 • Published Aug 29, 2025 • 3

upvoted a collection 4 months ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 13 days ago • 20

upvoted 2 papers 4 months ago

3D-MolT5: Towards Unified 3D Molecule-Text Modeling with 3D Molecular Tokenization

Paper • 2406.05797 • Published Jun 9, 2024 • 3

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28, 2025 • 116

upvoted 2 papers 6 months ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published Jul 23, 2025 • 36

REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once

Paper • 2507.10541 • Published Jul 14, 2025 • 29

upvoted a collection 6 months ago

AdaptThink

7 items • Updated May 19, 2025 • 2

upvoted an article 7 months ago

Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

+1

Mar 20, 2024

•

108

upvoted 2 papers 7 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28, 2025 • 43

rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset

Paper • 2505.21297 • Published May 27, 2025 • 29