sfujim / BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
☆596Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for BCQ
- Code for conservative Q-learning☆408Updated 2 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆294Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆479Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆512Updated 2 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆671Updated 2 years ago
- PyTorch Implementation of MADDPG (Lowe et. al. 2017)☆573Updated 4 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆302Updated 4 years ago
- PyTorch implementation of SAC-Discrete.☆284Updated 3 months ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆472Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆326Updated 2 years ago
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆665Updated last year
- A repository of high-performing hierarchical reinforcement learning models and algorithms.☆280Updated last year
- PyTorch implementation of soft actor critic☆814Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆253Updated 4 years ago
- Mean Field Multi-Agent Reinforcement Learning☆377Updated 4 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆533Updated last year
- PyTorch implementation of Trust Region Policy Optimization☆432Updated 6 years ago
- Inverse RL algorithms (APP, MaxEnt, GAIL, VAIL)☆706Updated 10 months ago
- A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)☆615Updated 6 years ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆398Updated 2 years ago
- A collection of reference environments for offline reinforcement learning☆1,334Updated this week
- PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.☆526Updated 6 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,216Updated 11 months ago
- Hello, I pushed some python environments for Multi Agent Reinforcement Learning.☆668Updated 2 years ago
- Soft Actor-Critic☆999Updated 11 months ago
- Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.☆393Updated last year
- Constrained Policy Optimization☆305Updated 7 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆331Updated last year
- ☆387Updated 5 years ago
- A collection of multi agent environments based on OpenAI gym.☆570Updated 4 months ago