AboudyKreidieh / h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆277Updated last year
Related projects: ⓘ
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆286Updated 2 years ago
- PyTorch implementation of SAC-Discrete.☆273Updated last month
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆253Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆475Updated last year
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆325Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆311Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆492Updated 2 years ago
- PyTorch implementation of Soft-Actor-Critic and Prioritized Experience Replay (PER) + Emphasizing Recent Experience (ERE) + Munchausen RL…☆265Updated 3 years ago
- Multi-Objective Reinforcement Learning☆246Updated 3 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆293Updated 3 weeks ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆113Updated last month
- Level-based Foraging (LBF): A multi-agent environment for RL☆152Updated this week
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆390Updated last year
- Mean Field Multi-Agent Reinforcement Learning☆374Updated 4 years ago
- Codebase for Evolutionary Reinforcement Learning (ERL) from the paper "Evolution-Guided Policy Gradients in Reinforcement Learning" publi…☆212Updated 4 years ago
- An extension of the PyMARL codebase that includes additional algorithms and environment support☆465Updated last month
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated last year
- ☆187Updated last year
- Code for conservative Q-learning☆393Updated 2 years ago
- Code for the paper "Phasic Policy Gradient"☆244Updated last year
- PyTorch implementation of FQF, IQN and QR-DQN.☆158Updated last month
- An engine to create high performance multi-agent grid world environments with hundreds or thousands of agents, along with a set of refere…☆191Updated last year
- multi-agent deep reinforcement learning for networked system control.☆371Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆118Updated 4 months ago
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆195Updated 5 years ago
- ☆117Updated last year
- Constrained Policy Optimization☆305Updated 7 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆664Updated 2 years ago
- Gridworld for MARL experiments☆137Updated 3 years ago
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆168Updated last year