Transformers
Safetensors
unsloth

image

round1
β”‚ πŸ† Average Reward: 0.3465                                                                                                                                                
β”‚ πŸ“ˆ Pass^k Metrics:                                                                                                                                                       
β”‚ k=1: 0.346                                                                                                                                                               
β”‚ k=2: 0.263   

round2
β”‚ πŸ† Average Reward: 0.3246                                                                                                                                                                                                                                                                                                                        
β”‚ πŸ“ˆ Pass^k Metrics:                                                                                                                                                       
β”‚ k=1: 0.325                                                                                                                                                               
β”‚ k=2: 0.219
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Dataset used to train amityco/Qwen3-4B-Thinking-2507-tau-train-v0.6