Azal Ahmad Khan's picture

1

Azal Ahmad Khan

AzalKhan

AI & ML interests

None yet

Recent Activity

updated a model 28 days ago

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new

published a model 28 days ago

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new

updated a model 28 days ago

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new

View all activity

Organizations

None yet

AzalKhan 's models 19

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated 28 days ago • 510

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated 28 days ago • 362

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated 28 days ago • 364

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L2048_new

Reinforcement Learning • 2B • Updated 28 days ago • 505

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated 29 days ago • 163

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated 29 days ago • 25

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_588_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated 29 days ago • 22

AzalKhan/Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_294_FlashRL_G4-L1024

Reinforcement Learning • 2B • Updated 29 days ago • 13

AzalKhan/Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_294

Reinforcement Learning • 2B • Updated Oct 10 • 16

AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_882

Reinforcement Learning • 2B • Updated Oct 8 • 13

AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_588

Reinforcement Learning • 2B • Updated Oct 8 • 5

AzalKhan/Qwen2.5-1.5B_open-r1-DAPO-Math-17k-Processed_294

Reinforcement Learning • 2B • Updated Oct 8 • 9

AzalKhan/Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_882

Reinforcement Learning • 2B • Updated Oct 7 • 297

AzalKhan/Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_588

Reinforcement Learning • 2B • Updated Oct 5 • 7

AzalKhan/Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_1

Reinforcement Learning • 2B • Updated Oct 3 • 1

AzalKhan/stableLM_ft_dpo_fin

Text Generation • 4B • Updated Mar 29, 2024

AzalKhan/mistral_ft_dpo_fin

Text Generation • 4B • Updated Mar 28, 2024

AzalKhan/gpt2_dpo

Text Generation • 0.1B • Updated Mar 28, 2024

AzalKhan/vicuna_ft_dpo_fin

Text Generation • 4B • Updated Mar 27, 2024