Transformers
Safetensors
unsloth

tau train hard (user gpt4.1)

retail β”‚ πŸ† Average Reward: 0.6579 β”‚
β”‚ πŸ“ˆ Pass^k Metrics:
β”‚ k=1: 0.658
β”‚ k=2: 0.535

image

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Dataset used to train amityco/Qwen3-4B-Thinking-2507-tau-train-v0.5