1 34 119

Peng Wang

stillarrow

https://peter-peng-w.github.io/

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

liked a dataset 9 days ago

open-r1/DAPO-Math-17k-Processed

upvoted a paper about 1 month ago

ExGRPO: Learning to Reason from Experience

View all activity

Organizations

None yet

liked a model 2 days ago

nvidia/Nemotron-Research-Reasoning-Qwen-1.5B

Text Generation • 2B • Updated Aug 12 • 10.7k • 228

liked a dataset 9 days ago

open-r1/DAPO-Math-17k-Processed

Viewer • Updated 6 days ago • 34.8k • 5.08k • 48

upvoted 2 papers about 1 month ago

ExGRPO: Learning to Reason from Experience

Paper • 2510.02245 • Published Oct 2 • 78

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 54

liked 3 models about 1 month ago

liked a dataset about 1 month ago

jupyter-agent/jupyter-agent-dataset

Viewer • Updated Sep 10 • 95.8k • 1.54k • 149

liked a model about 1 month ago

jinaai/jina-embeddings-v4

Visual Document Retrieval • 4B • Updated Sep 2 • 77.9k • 412

upvoted a collection about 1 month ago

Qwen3-VL

Collection

37 items • Updated 14 days ago • 403

liked 2 models about 1 month ago

Qwen/Qwen3-VL-235B-A22B-Thinking

Image-Text-to-Text • 236B • Updated Oct 4 • 11.4k • • 319

Alibaba-NLP/gme-Qwen2-VL-2B-Instruct

upvoted a paper about 2 months ago

VCRL: Variance-based Curriculum Reinforcement Learning for Large Language Models

Paper • 2509.19803 • Published Sep 24 • 118

updated a dataset about 2 months ago

stillarrow/MATH

Viewer • Updated Sep 25 • 26.5k • 16

published a dataset about 2 months ago

stillarrow/MATH

Viewer • Updated Sep 25 • 26.5k • 16

liked 3 datasets about 2 months ago

Jackrong/GPT-OSS-120B-Distilled-Reasoning-math

Viewer • Updated Aug 17 • 8.47k • 124 • 7

HuggingFaceH4/MATH-500

Viewer • Updated Nov 15, 2024 • 500 • 64.5k • 206

BytedTsinghua-SIA/DAPO-Math-17k

Viewer • Updated Apr 18 • 1.79M • 8.13k • 119

liked 2 models about 2 months ago

deepseek-ai/DeepSeek-V3.1-Terminus

Text Generation • 685B • Updated Sep 29 • 56.2k • • 344

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated Sep 5 • 387k • • 801

Peng Wang

AI & ML interests

Recent Activity

Organizations

stillarrow's activity