Zhiwei He's picture

Zhiwei He

zwhe99

·

https://zwhe99.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper about 2 months ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

liked a model about 2 months ago

MiniMaxAI/MiniMax-M2

updated a dataset 2 months ago

zwhe99/lcbv5

View all activity

Organizations

None yet

New activity in zwhe99/DeepMath-103K 8 months ago

You provided three responses for each answer. Which one is correct, or are all of them correct?

#2 opened 9 months ago by

Questions that contain answers

#4 opened 8 months ago by

commented 2 papers 9 months ago

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15 • 12 •

DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning

Paper • 2504.11456 • Published Apr 15 • 12 •

New activity in zwhe99/DeepMath-103K 9 months ago

[bot] Conversion to Parquet

#1 opened 9 months ago by

parquet-converter

add full citation

#3 opened 9 months ago by

commented 12 papers 9 months ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 52 •