joonleesky / train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆31Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for train-procgen-pytorch
- ☆47Updated last year
- ☆54Updated 8 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆99Updated 2 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆95Updated 5 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆71Updated 2 years ago
- ☆52Updated last year
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆18Updated 10 months ago
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆60Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆58Updated last year
- DMControl Generalization Benchmark☆168Updated 10 months ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆144Updated 3 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆163Updated 2 years ago
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆106Updated last year
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- ☆52Updated 4 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆114Updated 3 years ago
- ☆110Updated last year
- ☆85Updated 10 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Code for FOCAL Paper Published at ICLR 2021☆49Updated 11 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆155Updated 2 years ago
- CORRO code☆34Updated 2 years ago
- Conservative Q Learning on top of SAC☆119Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆116Updated last year
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆12Updated 9 months ago
- Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning☆28Updated 2 years ago