RyanNavillus / PPO-v3Links
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆12Updated 2 years ago
Alternatives and similar repositories for PPO-v3
Users that are interested in PPO-v3 are comparing it to the libraries listed below
Sorting:
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Updated last year
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆24Updated 2 months ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆11Updated 2 years ago
- Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"☆51Updated 3 weeks ago
- Flax Implementation of DreamerV3 on Crafter☆16Updated 3 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Accelerated replay buffers in JAX☆41Updated 2 years ago
- ☆31Updated last year
- Official Implementation of `An Optimisation Framework for Unsupervised Environment Design` from RLC 2025☆11Updated 3 weeks ago
- A collection of matrix games in JAX☆11Updated 6 months ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆28Updated 11 months ago
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆41Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Implementation of Proximal Policy Optimization in Jax+Flax☆19Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆88Updated 2 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 7 months ago
- Benchmarked implementations of Offline RL Algorithms.☆73Updated 3 months ago
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆14Updated 10 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 10 months ago
- Deep Reinforcement Learning Framework done with PyTorch☆36Updated 3 months ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Updated last year
- ☆19Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆114Updated 3 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆53Updated last month
- DreamerV3 implementation of Curious Replay, a method for prioritizing experience replay that is tailored to model-based reinforcement lea…☆36Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆27Updated 2 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago