brett-daley / gym-classics
Classic environments for reinforcement learning and dynamic programming, implemented in OpenAI Gym and Gymnasium.
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gym-classics
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- Benchmarking RL generalization in an interpretable way.☆131Updated 9 months ago
- Gym-like extensions for POMDP☆55Updated 3 years ago
- ☆189Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆171Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆204Updated 5 months ago
- Inverse Reinforcement Learning via State Marginal Matching, CoRL 2020☆41Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆154Updated 2 years ago
- ☆15Updated last year
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆151Updated 2 weeks ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆162Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆186Updated last year
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆69Updated last year
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆21Updated 4 years ago
- A curated list of awesome Model-based reinforcement learning resources☆90Updated 4 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- NeurIPS Reproducibility Challenge 2019☆20Updated 4 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆114Updated 3 years ago
- Conservative Q learning in Jax☆50Updated last year
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆72Updated last year
- Hindsight policy gradients☆43Updated 4 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆31Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆98Updated 2 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆156Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆114Updated last year
- ☆16Updated 5 years ago
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆50Updated 2 years ago
- ☆230Updated 2 years ago