Div99 / IQ-Learn
(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation
☆209Updated 2 years ago
Alternatives and similar repositories for IQ-Learn:
Users that are interested in IQ-Learn are comparing it to the libraries listed below
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆165Updated 2 years ago
- ☆191Updated last year
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆339Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆237Updated 4 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆310Updated 4 months ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆214Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆173Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆291Updated last year
- OpenAI Gym wrapper for the DeepMind Control Suite☆210Updated 7 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆204Updated 4 years ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆121Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆159Updated 2 years ago
- ☆335Updated 2 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆146Updated 3 years ago
- PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments☆303Updated 3 years ago
- Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.☆339Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆154Updated 2 months ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆121Updated 5 months ago
- Official implementation of "Accelerating Reinforcement Learning with Learned Skill Priors", Pertsch et al., CoRL 2020☆199Updated last year
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆193Updated 2 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆186Updated last year
- Lightweight multi-agent gridworld Gym environment☆199Updated last year
- Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined wit…☆188Updated 3 years ago
- Multi Task RL Baselines☆231Updated 3 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆187Updated 2 years ago
- Level-based Foraging (LBF): A multi-agent environment for RL☆169Updated 4 months ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆118Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆132Updated 8 months ago
- Implementation of Trajectory Transformer with attention caching and batched beam search☆108Updated last year
- Conservative Q Learning on top of SAC☆122Updated 2 years ago