Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10
5
Hanning Zhang
HanningZhang
Follow
circulartext's profile picture
RogerZhuo's profile picture
2 followers
·
10 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents
upvoted
a
paper
28 days ago
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
upvoted
a
paper
28 days ago
ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning
View all activity
Organizations
HanningZhang
's datasets
233
Sort: Recently updated
HanningZhang/OpenGenAlign-v2
Viewer
•
Updated
Sep 30
•
43.5k
•
8
HanningZhang/RAG-Reward-Modeling-v2
Viewer
•
Updated
Sep 30
•
43.5k
•
6
HanningZhang/scalebio_distill_qwen_math
Viewer
•
Updated
Sep 23
•
2k
•
11
HanningZhang/test-self-rewarding
Viewer
•
Updated
Sep 4
•
40k
•
3
HanningZhang/test-no-self-rewarding
Viewer
•
Updated
Sep 4
•
40k
•
9
HanningZhang/MLE-Policy-Trajectory
Viewer
•
Updated
Jul 8
•
1.22k
•
22
HanningZhang/MLE-Reward-Rating
Viewer
•
Updated
Jul 8
•
1.86k
•
19
HanningZhang/mistral1-selected-baseline
Viewer
•
Updated
May 4
•
3k
•
3
HanningZhang/llama32-selected-baseline
Viewer
•
Updated
May 4
•
3k
•
8
HanningZhang/scalebio_reasoning_think_220k_with_system_and_cot
Viewer
•
Updated
Apr 22
•
193k
•
6
HanningZhang/scalebio_reasoning_nonthink_50k_with_system_and_cot
Viewer
•
Updated
Apr 19
•
50k
•
13
HanningZhang/scalebio_reasoning_nonthink_20k_with_system_and_cot
Viewer
•
Updated
Apr 19
•
20k
•
4
HanningZhang/scalebio_reasoning_think_20k
Viewer
•
Updated
Apr 16
•
20k
•
8
HanningZhang/scalebio_reasoning_think_50k
Viewer
•
Updated
Apr 16
•
50k
•
14
HanningZhang/scalebio_reasoning_think_100k
Viewer
•
Updated
Apr 16
•
100k
•
10
HanningZhang/scalebio_reasoning_nonthink_200k
Viewer
•
Updated
Apr 16
•
200k
•
15
HanningZhang/scalebio_reasoning_nonthink_100k
Viewer
•
Updated
Apr 16
•
100k
•
15
HanningZhang/scalebio_reasoning_nonthink_50k
Viewer
•
Updated
Apr 16
•
50k
•
16
HanningZhang/scalebio_reasoning_nonthink_20k
Viewer
•
Updated
Apr 16
•
20k
•
14
HanningZhang/scalebio_reasoning_think_200k
Viewer
•
Updated
Apr 16
•
133k
•
11
HanningZhang/scalebio_original_reasoning
Viewer
•
Updated
Apr 13
•
3.4k
•
2
HanningZhang/scalebio_reasoning_nonthink
Viewer
•
Updated
Apr 13
•
2k
•
5
HanningZhang/scalebio_reasoning_think
Viewer
•
Updated
Apr 13
•
2k
•
8
HanningZhang/UltraFeedback_eval
Viewer
•
Updated
Apr 11
•
1.56k
•
1
HanningZhang/scalebio_llama_math_1.5k_scalebio_1ep
Viewer
•
Updated
Apr 2
•
21.4k
•
3
HanningZhang/scalebio_qwen_math_1.5k_scalebio_1ep
Viewer
•
Updated
Apr 1
•
21.4k
•
10
HanningZhang/scalebio_qwen_math_1.5k_scalebio
Viewer
•
Updated
Apr 1
•
21.4k
•
11
HanningZhang/scalebio_llama_math_100k_less
Viewer
•
Updated
Mar 31
•
101k
•
4
HanningZhang/scalebio_llama_math_100k_rho
Viewer
•
Updated
Mar 31
•
101k
•
3
HanningZhang/scalebio_llama_math_100k_scalebio
Viewer
•
Updated
Mar 30
•
101k
•
3
Previous
1
2
3
...
8
Next