Hanning Zhang's picture

12 5

Hanning Zhang

HanningZhang

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems

upvoted a paper 7 days ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

updated a model 11 days ago

HanningZhang/deepseek_only_conjecture_claude_deepseek_train_data_max1_5e-7_bs32_decay1e-6_2ep_ep1

View all activity

Organizations

HanningZhang 's datasets 233

HanningZhang/OpenGenAlign-v2

Viewer • Updated Sep 30, 2025 • 43.5k • 7

HanningZhang/RAG-Reward-Modeling-v2

Viewer • Updated Sep 30, 2025 • 43.5k • 10

HanningZhang/scalebio_distill_qwen_math

Viewer • Updated Sep 23, 2025 • 2k • 6

HanningZhang/test-self-rewarding

Viewer • Updated Sep 4, 2025 • 40k • 6

HanningZhang/test-no-self-rewarding

Viewer • Updated Sep 4, 2025 • 40k • 3

HanningZhang/MLE-Policy-Trajectory

Viewer • Updated Jul 8, 2025 • 1.22k • 3

HanningZhang/MLE-Reward-Rating

Viewer • Updated Jul 8, 2025 • 1.86k • 3

HanningZhang/mistral1-selected-baseline

Viewer • Updated May 4, 2025 • 3k • 1

HanningZhang/llama32-selected-baseline

Viewer • Updated May 4, 2025 • 3k • 3

HanningZhang/scalebio_reasoning_think_220k_with_system_and_cot

Viewer • Updated Apr 22, 2025 • 193k • 3

HanningZhang/scalebio_reasoning_nonthink_50k_with_system_and_cot

Viewer • Updated Apr 19, 2025 • 50k • 3

HanningZhang/scalebio_reasoning_nonthink_20k_with_system_and_cot

Viewer • Updated Apr 19, 2025 • 20k • 3

HanningZhang/scalebio_reasoning_think_20k

Viewer • Updated Apr 16, 2025 • 20k

HanningZhang/scalebio_reasoning_think_50k

Viewer • Updated Apr 16, 2025 • 50k

HanningZhang/scalebio_reasoning_think_100k

Viewer • Updated Apr 16, 2025 • 100k • 11

HanningZhang/scalebio_reasoning_nonthink_200k

Viewer • Updated Apr 16, 2025 • 200k

HanningZhang/scalebio_reasoning_nonthink_100k

Viewer • Updated Apr 16, 2025 • 100k

HanningZhang/scalebio_reasoning_nonthink_50k

Viewer • Updated Apr 16, 2025 • 50k • 5

HanningZhang/scalebio_reasoning_nonthink_20k

Viewer • Updated Apr 16, 2025 • 20k • 1

HanningZhang/scalebio_reasoning_think_200k

Viewer • Updated Apr 16, 2025 • 133k • 3

HanningZhang/scalebio_original_reasoning

Viewer • Updated Apr 13, 2025 • 3.4k

HanningZhang/scalebio_reasoning_nonthink

Viewer • Updated Apr 13, 2025 • 2k

HanningZhang/scalebio_reasoning_think

Viewer • Updated Apr 13, 2025 • 2k • 1

HanningZhang/UltraFeedback_eval

Viewer • Updated Apr 11, 2025 • 1.56k

HanningZhang/scalebio_llama_math_1.5k_scalebio_1ep

Viewer • Updated Apr 2, 2025 • 21.4k • 4

HanningZhang/scalebio_qwen_math_1.5k_scalebio_1ep

Viewer • Updated Apr 1, 2025 • 21.4k • 6

HanningZhang/scalebio_qwen_math_1.5k_scalebio

Viewer • Updated Apr 1, 2025 • 21.4k • 2

HanningZhang/scalebio_llama_math_100k_less

Viewer • Updated Mar 31, 2025 • 101k • 7

HanningZhang/scalebio_llama_math_100k_rho

Viewer • Updated Mar 31, 2025 • 101k • 5

HanningZhang/scalebio_llama_math_100k_scalebio

Viewer • Updated Mar 30, 2025 • 101k • 2