AI & ML interests
None defined yet.
Recent Activity
adalberto-temp/llama_3b_apo_zero_gsm8k_it2
Text Generation
•
3B
•
Updated
•
4
adalberto-temp/llama_3b_apo_zero_math
Text Generation
•
3B
•
Updated
•
15
adalberto-temp/llama_3b_apo_zero_gsm8k
Text Generation
•
3B
•
Updated
•
24
adalberto-temp/llama_3b_dpop_gsm8k
Text Generation
•
3B
•
Updated
•
17
adalberto-temp/llama_3b_dpop_tulu
Text Generation
•
3B
•
Updated
•
17
adalberto-temp/llama_3b_mix_dpop_4k
Text Generation
•
3B
•
Updated
•
17
adalberto-temp/llama_3b_mix_dpo_sft
Text Generation
•
3B
•
Updated
•
3
adalberto-temp/llama_3b_mix_dpo
Text Generation
•
3B
•
Updated
•
4
adalberto-temp/llama-3b-open-dpo-sft
Text Generation
•
3B
•
Updated
•
3
adalberto-temp/llama-3b-open-dpo
Text Generation
•
3B
•
Updated
•
4
adalberto-temp/llama-3b-gold-onpolicy-mix-teacher-8b
Text Generation
•
3B
•
Updated
•
2
adalberto-temp/Llama-3.2-3B-Instruct-GOLD
Text Generation
•
3B
•
Updated
•
5
adalberto-temp/llama-3b-gold-onpolicy-mix
Text Generation
•
3B
•
Updated
•
7
adalberto-temp/llama-3b-gold-onpolicy-gsm8k
Text Generation
•
3B
•
Updated
•
5
adalberto-temp/llama-3b-gsm8k-dpo
Text Generation
•
3B
•
Updated
•
3
adalberto-temp/llama-3b-enem-synth-dpo
Text Generation
•
3B
•
Updated
•
3
adalberto-temp/energy_slerp_gsm8k_dpo
Text Generation
•
3B
•
Updated
•
3
adalberto-temp/energy_kl_v0
Text Generation
•
3B
•
Updated
•
5
adalberto-temp/student-distill-online-dpo
Text Generation
•
3B
•
Updated
•
4
adalberto-temp/energy_sft_magpie_v4
Text Generation
•
3B
•
Updated
•
5
adalberto-temp/energy_instruct_3b_slerp
Text Generation
•
3B
•
Updated
•
9
adalberto-temp/energy_sft_magpie_v3
Text Generation
•
3B
•
Updated
•
10
adalberto-temp/energy_sft_magpie_v2
Text Generation
•
3B
•
Updated
•
2
adalberto-temp/energy_sft_5
Text Generation
•
3B
•
Updated
•
3
adalberto-temp/energy_sft_magpie
Text Generation
•
3B
•
Updated
•
4
adalberto-temp/energy_instruct_3b
Text Generation
•
3B
•
Updated
•
4
adalberto-temp/energy_sft_4
Text Generation
•
3B
•
Updated
•
3
adalberto-temp/energy_sft_2
Text Generation
•
3B
•
Updated
•
6
adalberto-temp/energy_base
Text Generation
•
3B
•
Updated
•
7
adalberto-temp/energy_apo_V0.1_long
Text Generation
•
3B
•
Updated
•
6