--- library_name: transformers tags: - unsloth --- tau train hard no sys (user gpt4.1) retail │ 🏆 Average Reward: 0.5702 │ │ 📈 Pass^k Metrics: │ k=1: 0.570 │ k=2: 0.439 ![image](https://cdn-uploads.huggingface.co/production/uploads/64739bc371f07ae738d2d61d/NKYF6cUPOrVYZMKeDnPnW.png)