Stable Baselines3 is a set of improved implementations of reinforcement learning algorithms in PyTorch. It is the next major version of Stable Baselines.

Github repository: https://github.com/DLR-RM/stable-baselines3

RL Baselines3 Zoo (collection of pre-trained agents): https://github.com/DLR-RM/rl-baselines3-zoo

RL Baselines3 Zoo also offers a simple interface to train, evaluate agents and do hyperparameter tuning.

Main Features

  • Unified structure for all algorithms

  • PEP8 compliant (unified code style)

  • Documented functions and classes

  • Tests, high code coverage and type hints

  • Clean code

  • Tensorboard support

RL Algorithms

Citing Stable Baselines3

To cite this project in publications:

To any interested in making the rl baselines better, there are still some improvements that need to be done. You can check issues in the repo.

If you want to contribute, please read CONTRIBUTING.md first.

