Stable-Baselines-Team / stable-baselines
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
☆298Updated 2 years ago
Alternatives and similar repositories for stable-baselines
Users that are interested in stable-baselines are comparing it to the libraries listed below
Sorting:
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆305Updated 2 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆546Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆542Updated 3 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆489Updated 2 years ago
- Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code☆588Updated last month
- Code for conservative Q-learning☆438Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆315Updated 3 years ago
- Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...☆408Updated 3 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆628Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆302Updated 9 months ago
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆223Updated 3 months ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆750Updated last year
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆472Updated last year
- Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019☆667Updated last year
- DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DD…☆333Updated 2 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆336Updated 5 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆350Updated 2 years ago
- Tools for accelerating safe exploration research.☆534Updated 2 years ago
- Keeping track of RL experiments☆161Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆359Updated 3 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆739Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆536Updated 3 years ago
- Code for the paper "Phasic Policy Gradient"☆261Updated 2 years ago
- Imitation learning algorithms☆525Updated last month
- PyTorch implementation of FQF, IQN and QR-DQN.☆176Updated 9 months ago
- A parallel framework for population-based multi-agent reinforcement learning.☆528Updated last year
- A collection of reference environments for offline reinforcement learning☆1,492Updated 5 months ago
- Reinforcement Learning Algorithms Based on PyTorch☆449Updated 3 years ago
- A collection of multi agent environments based on OpenAI gym.☆597Updated 10 months ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆425Updated 2 years ago