Stable-Baselines-Team / stable-baselinesLinks
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
☆300Updated 2 years ago
Alternatives and similar repositories for stable-baselines
Users that are interested in stable-baselines are comparing it to the libraries listed below
Sorting:
- Code for conservative Q-learning☆446Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆545Updated 3 years ago
- PyTorch implementation of SAC-Discrete.☆302Updated 10 months ago
- Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code☆598Updated 2 weeks ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆314Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆495Updated 2 years ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆753Updated last year
- Colab notebooks part of the documentation of Stable Baselines reinforcement learning library☆227Updated 4 months ago
- Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019☆678Updated last year
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆429Updated 2 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆317Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆548Updated last year
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆631Updated 4 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆344Updated 5 years ago
- Tools for accelerating safe exploration research.☆538Updated 2 years ago
- Reinforcement learning library(framework) designed for PyTorch, implements DQN, DDPG, A2C, PPO, SAC, MADDPG, A3C, APEX, IMPALA ...☆409Updated 3 years ago
- PyTorch implementation of FQF, IQN and QR-DQN.☆177Updated 10 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆361Updated 3 years ago
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆455Updated 2 weeks ago
- A collection of multi agent environments based on OpenAI gym.☆601Updated 10 months ago
- Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in Open…☆274Updated 2 years ago
- Multi-objective Gymnasium environments for reinforcement learning☆326Updated 3 months ago
- A collection of reference environments for offline reinforcement learning☆1,505Updated 6 months ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆684Updated 2 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆352Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆261Updated 2 years ago
- A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities☆393Updated last week
- A library for ready-made reinforcement learning agents and reusable components for neat prototyping☆300Updated last year
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆289Updated 4 years ago
- DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DD…☆333Updated 2 years ago