Itomigna2 / Muesli-lunarlander
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆16Updated last year
Alternatives and similar repositories for Muesli-lunarlander:
Users that are interested in Muesli-lunarlander are comparing it to the libraries listed below
- ☆17Updated 3 years ago
- MiniZero: An AlphaZero and MuZero Training Framework☆88Updated last month
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆88Updated 2 weeks ago
- Pytorch Implementation of Stochastic MuZero for gym environment. This algorithm is capable of supporting a wide range of action and obser…☆64Updated last year
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆38Updated 2 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆86Updated 2 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆27Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆111Updated 8 months ago
- ☆74Updated 5 months ago
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆74Updated 2 years ago
- An implementation of PPO in Pytorch☆72Updated 2 months ago
- DEIR: Efficient and Robust Exploration through Discriminative-Model-Based Episodic Intrinsic Rewards☆24Updated 11 months ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆97Updated 5 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆141Updated last year
- ☆44Updated last year
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- A pytorch implementation of Dreamer☆20Updated 2 years ago
- A Simplified Pytorch Version of the Dreamer Algorithm☆126Updated last year
- A project that provides help for using DeepMind's mctx on gym-style environments.☆58Updated 5 months ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆175Updated 10 months ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- Experimentation with Regularized Nash Dynamics on a GPU accelerated game☆47Updated 2 years ago
- ☆41Updated 9 months ago
- Code for the paper "Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference"☆42Updated 9 months ago
- Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023☆46Updated last year
- Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …☆12Updated 8 months ago
- Baselines for gymnax 🤖☆66Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆100Updated last year
- ☆15Updated 2 years ago
- Multi-Agent Deep Reinforcement Learning by using Asynchronous & Impala Proximal Policy Optimization in Pytorch with some explanation☆36Updated 4 years ago