Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
s
august66
Follow
callmespring's profile picture
Kyleyee's profile picture
2 followers
·
2 following
AI & ML interests
None yet
Organizations
models
5
Sort: Recently updated
august66/hh_qwen1.5_drpo
2B
•
Updated
Oct 11
•
9
august66/hh_qwen_1.5b_dpo_model_2
Text Generation
•
2B
•
Updated
Sep 9
•
4
august66/ultrafeedback_qwen_1.5b_drpo_model
Updated
Jul 9
august66/qwen2-sft-dpo-imdb-beta-1.0
Updated
Jun 2
august66/qwen2-sft-final
Text Generation
•
0.5B
•
Updated
Jun 1
•
33
datasets
26
Sort: Recently updated
august66/drpo_hh_qwen2.5_1.5b_with_ref_btpref
Viewer
•
Updated
Oct 8
•
48.8k
•
30
august66/hh_qwen2.5_1.5b_with_bias_bt_pref
Viewer
•
Updated
Oct 2
•
18k
•
20
august66/hh_qwen2.5_1.5b_with_bias
Viewer
•
Updated
Sep 27
•
18k
•
23
august66/drpo_hh_qwen2.5_1.5b
Viewer
•
Updated
Sep 8
•
43.8k
•
19
august66/dpo_reward_dist_pi_theta_prompt_3
Viewer
•
Updated
Sep 3
•
5k
•
16
august66/dpo_reward_dist_pi_theta_prompt_2
Viewer
•
Updated
Sep 3
•
5k
•
15
august66/dpo_reward_dist_pi_theta
Viewer
•
Updated
Aug 23
•
5k
•
10
august66/reward_distribution_2_tldr_openassist_pi_ref
Viewer
•
Updated
Aug 4
•
5k
•
7
august66/reward_distribution_2_tldr_openassist_pi_theta
Viewer
•
Updated
Aug 4
•
5k
•
12
august66/reward_distribution_tldr_openassist_pi_theta
Viewer
•
Updated
Jul 30
•
5k
•
10
View 26 datasets