openai / safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
☆412Updated last year
Alternatives and similar repositories for safety-starter-agents:
Users that are interested in safety-starter-agents are comparing it to the libraries listed below
- Tools for accelerating safe exploration research.☆521Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆349Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆304Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆532Updated 3 years ago
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆295Updated last year
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆286Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆487Updated 2 years ago
- Code for conservative Q-learning☆426Updated 3 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆345Updated 2 years ago
- PyTorch implementation of SAC-Discrete.☆299Updated 7 months ago
- NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark☆435Updated 3 weeks ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆260Updated 4 years ago
- Keeping track of RL experiments☆162Updated 2 years ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆582Updated 5 months ago
- PyTorch Implementation of MADDPG (Lowe et. al. 2017)☆613Updated 5 years ago
- Code for the paper "Phasic Policy Gradient"☆259Updated last year
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆484Updated 2 years ago
- Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)☆218Updated 2 years ago
- Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).☆163Updated 11 months ago
- A plotter for reinforcement learning (RL)☆221Updated 3 years ago
- A collection of multi agent environments based on OpenAI gym.☆595Updated 8 months ago
- An elegant PyTorch offline reinforcement learning library for researchers.☆310Updated 11 months ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆721Updated 2 years ago
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆686Updated last year
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆239Updated 4 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆444Updated last year
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆313Updated 6 months ago
- ☆238Updated last year
- Dream to Control: Learning Behaviors by Latent Imagination☆526Updated 3 years ago
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆190Updated 2 years ago