Stable-Baselines3 Docs - Reliable Reinforcement Learning Implementations

Stable Baselines3 (SB3) is a set of reliable implementations of reinforcement learning algorithms in PyTorch. It is the next major version of Stable Baselines.

Github repository:


RL Baselines3 Zoo (training framework for SB3):

RL Baselines3 Zoo provides a collection of pre-trained agents, scripts for training, evaluating agents, tuning hyperparameters, plotting results and recording videos.

SB3 Contrib (experimental RL code, latest algorithms):

Main Features

  • Unified structure for all algorithms

  • PEP8 compliant (unified code style)

  • Documented functions and classes

  • Tests, high code coverage and type hints

  • Clean code

  • Tensorboard support

  • The performance of each algorithm was tested (see Results section in their respective page)

User Guide

Citing Stable Baselines3

To cite this project in publications:

  author  = {Antonin Raffin and Ashley Hill and Adam Gleave and Anssi Kanervisto and Maximilian Ernestus and Noah Dormann},
  title   = {Stable-Baselines3: Reliable Reinforcement Learning Implementations},
  journal = {Journal of Machine Learning Research},
  year    = {2021},
  volume  = {22},
  number  = {268},
  pages   = {1-8},
  url     = {}


To any interested in making the rl baselines better, there are still some improvements that need to be done. You can check issues in the repo.

If you want to contribute, please read first.

Indices and tables