|
|
--- |
|
|
library_name: ml-agents |
|
|
tags: |
|
|
- SnowballTarget |
|
|
- deep-reinforcement-learning |
|
|
- reinforcement-learning |
|
|
- ML-Agents-SnowballTarget |
|
|
--- |
|
|
|
|
|
# **ppo** Agent playing **SnowballTarget** |
|
|
This is a trained model of a **ppo** agent playing **SnowballTarget** |
|
|
using the [Unity ML-Agents Library](https://github.com/Unity-Technologies/ml-agents). |
|
|
|
|
|
## Results |
|
|
-[INFO] SnowballTarget. |
|
|
-Step: 400000. |
|
|
-Time Elapsed: 903.639 s. |
|
|
-Mean Reward: 25.591. |
|
|
-Std of Reward: 1.992. |
|
|
|
|
|
## Hyperparameters |
|
|
%%file /content/ml-agents/config/ppo/SnowballTarget.yaml |
|
|
```yaml |
|
|
behaviors: |
|
|
SnowballTarget: |
|
|
trainer_type: ppo |
|
|
summary_freq: 10000 |
|
|
keep_checkpoints: 10 |
|
|
checkpoint_interval: 50000 |
|
|
max_steps: 400000 |
|
|
time_horizon: 32 |
|
|
threaded: true |
|
|
hyperparameters: |
|
|
learning_rate: 0.0003 |
|
|
learning_rate_schedule: linear |
|
|
batch_size: 128 |
|
|
buffer_size: 2048 |
|
|
beta: 0.005 |
|
|
epsilon: 0.2 |
|
|
lambd: 0.95 |
|
|
num_epoch: 3 |
|
|
network_settings: |
|
|
normalize: false |
|
|
hidden_units: 256 |
|
|
num_layers: 3 |
|
|
vis_encode_type: nature_cnn |
|
|
reward_signals: |
|
|
extrinsic: |
|
|
gamma: 0.9 |
|
|
strength: 1.0 |
|
|
``` |
|
|
|
|
|
### Resume the training |
|
|
```bash |
|
|
mlagents-learn <your_configuration_file_path.yaml> --run-id=<run_id> --resume |
|
|
``` |
|
|
|
|
|
### Watch your Agent play |
|
|
You can watch your agent **playing directly in your browser** |
|
|
|
|
|
1. If the environment is part of ML-Agents official environments, go to https://huggingface.co/unity |
|
|
2. Step 1: Find your model_id: enrique2701/ppo-SnowballTarget |
|
|
3. Step 2: Select your *.nn /*.onnx file |
|
|
4. Click on Watch the agent play 👀 |
|
|
|