Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
12
5
Hanning Zhang
HanningZhang
Follow
RogerZhuo's profile picture
circulartext's profile picture
2 followers
·
10 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
8 days ago
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
upvoted
a
paper
13 days ago
PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary
updated
a model
16 days ago
HanningZhang/deepseek_only_conjecture_claude_deepseek_train_data_max1_5e-7_bs32_decay1e-6_2ep_ep1
View all activity
Organizations
HanningZhang
's models
282
Sort: Recently updated
HanningZhang/Qwen-PPO-Selfcorr-Step160
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step150
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step140
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step130
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step120
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step110
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step100
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step90
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step80
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step70
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step60
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step50
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step40
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step30
8B
•
Updated
Feb 21, 2025
HanningZhang/Qwen-PPO-Selfcorr-Step20
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step10
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen-PPO-Selfcorr-Step200
8B
•
Updated
Feb 21, 2025
•
1
HanningZhang/Qwen_numina_iter7_new
Text Generation
•
8B
•
Updated
Feb 13, 2025
•
1
HanningZhang/Qwen_numina_iter6_new
Text Generation
•
8B
•
Updated
Feb 13, 2025
HanningZhang/Qwen_numina_iter5_new
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Qwen_numina_iter2_new
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Qwen_numina_iter4_new
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Qwen_numina_iter3_new
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Qwen_numina_iter1_new
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Llama3_numina_iter3
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Llama3_numina_iter2
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Llama3_numina_iter1
Text Generation
•
8B
•
Updated
Feb 11, 2025
HanningZhang/Qwen_onlymath_iter9
Text Generation
•
8B
•
Updated
Feb 10, 2025
HanningZhang/Qwen_onlymath_iter8
Text Generation
•
8B
•
Updated
Feb 10, 2025
HanningZhang/Qwen_onlymath_iter7
Text Generation
•
8B
•
Updated
Feb 10, 2025
•
1
Previous
1
...
5
6
7
8
9
10
Next