Transformers
Safetensors
unsloth
ping98k's picture
Update README.md
f3150d2 verified
---
library_name: transformers
tags:
- unsloth
datasets:
- amityco/tau-bench-retail-train-next-action-medium
---
```
tau train medium 100 sample
β”‚ πŸ† Average Reward: 0.5175 β”‚
β”‚ β”‚
β”‚ πŸ“ˆ Pass^k Metrics: β”‚
β”‚ k=1: 0.518 β”‚
β”‚ k=2: 0.421
tau train medium 200 sample
β”‚ πŸ† Average Reward: 0.5526 β”‚
β”‚ β”‚
β”‚ πŸ“ˆ Pass^k Metrics: β”‚
β”‚ k=1: 0.553 β”‚
β”‚ k=2: 0.482
tau train medium 300 sample
β”‚ πŸ† Average Reward: 0.5175 β”‚
β”‚ β”‚
β”‚ πŸ“ˆ Pass^k Metrics: β”‚
β”‚ k=1: 0.518 β”‚
β”‚ k=2: 0.421
```
![image](https://cdn-uploads.huggingface.co/production/uploads/64739bc371f07ae738d2d61d/-efahsrelKuAcCpzzGrtv.png)
![image](https://cdn-uploads.huggingface.co/production/uploads/64739bc371f07ae738d2d61d/ecldtyxoLdRyUkhwSpdUg.png)