facebookresearch / deep_bisim4control
Learning Invariant Representations for Reinforcement Learning without Reconstruction
☆147Updated 3 years ago
Alternatives and similar repositories for deep_bisim4control:
Users that are interested in deep_bisim4control are comparing it to the libraries listed below
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆165Updated 2 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆165Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆174Updated 2 years ago
- Conservative Q Learning on top of SAC☆127Updated 2 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆158Updated 3 years ago
- ☆112Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆151Updated last year
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆125Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆211Updated 10 months ago
- ☆53Updated last year
- ☆193Updated last year
- DMControl Generalization Benchmark☆167Updated last year
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆122Updated 3 years ago
- ☆52Updated 4 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated last week
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆30Updated 4 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆239Updated 4 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆160Updated 4 months ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆210Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated last year
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆119Updated 3 years ago
- Code for "Multi-task Reinforcement Learning with Soft Modularization"☆119Updated 4 years ago
- ☆46Updated 2 years ago
- Model-Based Offline Reinforcement Learning☆48Updated 4 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- ☆66Updated 4 years ago
- ☆48Updated last year