Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
5
Hanning Zhang
HanningZhang
Follow
circulartext's profile picture
RogerZhuo's profile picture
2 followers
·
10 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
10 days ago
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
upvoted
a
paper
15 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
updated
a model
18 days ago
HanningZhang/deepseek_only_conjecture_claude_deepseek_train_data_max1_5e-7_bs32_decay1e-6_2ep_ep1
View all activity
Organizations
HanningZhang
's models
282
Sort: Recently updated
HanningZhang/Qwen-PPO-Selfcorr-Step150-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step140-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step130-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step120-Vanilla
8B
•
Updated
Feb 23, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step110-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step100-Vanilla
8B
•
Updated
Feb 23, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step90-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step80-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step70-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step60-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step50-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step40-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step30-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step20-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step10-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step300-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step80-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step70-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step60-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step50-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step40-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step30-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step20-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step10-startfrom-noselfcorr-step100
8B
•
Updated
Feb 22, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step230
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step220
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step210
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step190
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step180
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step170
8B
•
Updated
Feb 21, 2025
•
1
Previous
1
...
4
5
6
7
8
...
10
Next