AboudyKreidieh / h-baselinesLinks

A repository of high-performing hierarchical reinforcement learning models and algorithms.

☆316

Alternatives and similar repositories for h-baselines

Users that are interested in h-baselines are comparing it to the libraries listed below

Sorting:

nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆318Updated 3 years ago
andrew-j-levy / Hierarchical-Actor-Critc-HAC-
This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.
☆259Updated 5 years ago
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆307Updated last year
BY571 / Soft-Actor-Critic-and-Extensions
PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…
☆289Updated 4 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆130Updated last year
jannerm / mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆504Updated 2 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆143Updated 6 years ago
sfujim / TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆366Updated 3 years ago
schroederdewitt / multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
☆357Updated 2 years ago
schatty / oprl
A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing
☆134Updated 2 weeks ago
TonghanWang / ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
☆160Updated 2 years ago
toshikwa / fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
☆182Updated last year
ArnaudFickinger / gym-multigrid
Lightweight multi-agent gridworld Gym environment
☆209Updated last year
mlii / mfrl
Mean Field Multi-Agent Reinforcement Learning
☆397Updated 5 years ago
openai / safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
☆435Updated 2 years ago
denisyarats / pytorch_sac
PyTorch implementation of Soft Actor-Critic (SAC)
☆555Updated 3 years ago
aviralkumar2907 / CQL
Code for conservative Q-learning
☆450Updated 3 years ago
ac-93 / soft-actor-critic
Modified versions of the SAC algorithm from spinningup for discrete action spaces and image observations.
☆96Updated 5 years ago
lcswillems / torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆203Updated 2 years ago
ShawK91 / Evolutionary-Reinforcement-Learning
Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…
☆236Updated 4 years ago
semitable / lb-foraging
Level-based Foraging (LBF): A multi-agent environment for RL
☆187Updated 10 months ago
TianhongDai / hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
☆433Updated 3 years ago
BY571 / CQL
PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…
☆137Updated last year
Sonkyunghwan / QTRAN
There will be updates later
☆84Updated 6 years ago
ermongroup / MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
☆210Updated 6 years ago
Farama-Foundation / MAgent
An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…
☆194Updated 2 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 6 years ago
qian18long / epciclr2020
☆121Updated 2 years ago
toshikwa / gail-airl-ppo.pytorch
PyTorch implementation of GAIL and AIRL based on PPO.
☆221Updated 4 years ago
watchernyu / REDQ
Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆171Updated 8 months ago