joonleesky / train-procgen-pytorch
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆31Updated 4 years ago
Alternatives and similar repositories for train-procgen-pytorch:
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
- ☆54Updated 11 months ago
- ☆47Updated last year
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 2 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆98Updated 8 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆19Updated last year
- COOM: Benchmarking Continual Reinforcement Learning on Doom☆14Updated last week
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆65Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆108Updated 3 years ago
- ☆55Updated 2 years ago
- ☆52Updated 4 years ago
- Code for FOCAL Paper Published at ICLR 2021☆51Updated last year
- Deep Hierarchical Planning from Pixels☆94Updated 2 years ago
- Learning Laplacian Representations in Reinforcement Learning☆17Updated 4 years ago
- ☆41Updated 3 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆44Updated 3 years ago
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 2 months ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆146Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆118Updated 3 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆111Updated last year
- ☆111Updated last year
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆85Updated 3 years ago
- ☆26Updated last year
- ☆46Updated 2 years ago