Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
5
Hanning Zhang
HanningZhang
Follow
RogerZhuo's profile picture
circulartext's profile picture
2 followers
·
10 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
8 days ago
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
upvoted
a
paper
12 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
updated
a model
16 days ago
HanningZhang/deepseek_only_conjecture_claude_deepseek_train_data_max1_5e-7_bs32_decay1e-6_2ep_ep1
View all activity
Organizations
HanningZhang
's models
282
Sort: Recently updated
HanningZhang/Qwen-PPO-Selfcorr-Step20-Rebuttal-kumar
8B
•
Updated
Mar 27, 2025
•
2
HanningZhang/Qwen-PPO-Selfcorr-Step180-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step160-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step140-Rebuttal
8B
•
Updated
Mar 27, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step120-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step100-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step80-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step60-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step40-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step20-Rebuttal
8B
•
Updated
Mar 27, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step300-Rebuttal
8B
•
Updated
Mar 26, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step280-Rebuttal
8B
•
Updated
Mar 26, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step260-Rebuttal
8B
•
Updated
Mar 26, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step240-Rebuttal
8B
•
Updated
Mar 26, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step220-Rebuttal
8B
•
Updated
Mar 26, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step200-Rebuttal
8B
•
Updated
Mar 26, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step290-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step280-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step270-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step260-Vanilla
8B
•
Updated
Feb 24, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step250-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step240-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step230-Vanilla
8B
•
Updated
Feb 24, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step220-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step210-Vanilla
8B
•
Updated
Feb 24, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step200-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step190-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step180-Vanilla
8B
•
Updated
Feb 24, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step170-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step160-Vanilla
8B
•
Updated
Feb 23, 2025
•
1
Previous
1
...
3
4
5
6
7
...
10
Next