joonleesky / train-procgen-pytorchLinks
Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.
☆30Updated 5 years ago
Alternatives and similar repositories for train-procgen-pytorch
Users that are interested in train-procgen-pytorch are comparing it to the libraries listed below
Sorting:
- DMControl Generalization Benchmark☆181Updated last year
- Representation Learning for RL☆128Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆225Updated last year
- Simple maze environments using mujoco-py☆57Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆152Updated 4 years ago
- Deep Hierarchical Planning from Pixels☆110Updated 2 years ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆22Updated last year
- ☆202Updated 2 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.☆132Updated 4 years ago
- PyTorch implementation of DreamerV2 model-based RL algorithm☆234Updated 2 years ago
- Pytorch version of Dreamer, which follows the original TF v2 codes.☆134Updated 3 years ago
- ☆358Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆118Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆80Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆181Updated 3 years ago
- ☆108Updated last year
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆18Updated 4 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆103Updated 3 years ago
- ☆114Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- Conservative Q Learning on top of SAC☆132Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190Updated 3 years ago
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112Updated last year
- Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…☆129Updated 2 years ago
- ☆52Updated 5 years ago
- Learning Laplacian Representations in Reinforcement Learning☆18Updated 4 years ago
- ☆58Updated 2 years ago
- ☆48Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- ☆54Updated last year