PPO Agent Playing CartPole-v1

Trained with a minimal CleanRL-style PPO implementation in Google Colab.

Results

  • Mean reward: 59.50
  • Std reward: 31.81
Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading

Evaluation results