Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆29Sep 10, 2020Updated 5 years ago
Alternatives and similar repositories for MPO
Users that are interested in MPO are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆80Nov 19, 2022Updated 3 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- ☆13Jul 9, 2018Updated 7 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆51Jun 3, 2022Updated 3 years ago
- Deep Reinforcement Learning for Continuous Control in PyTorch☆105Dec 31, 2021Updated 4 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 3 years ago
- A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning☆12Aug 31, 2020Updated 5 years ago
- ☆14Jul 12, 2021Updated 4 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- ☆18Jan 4, 2021Updated 5 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆15Dec 8, 2020Updated 5 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 3 years ago
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆18Mar 1, 2021Updated 5 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Jul 27, 2022Updated 3 years ago
- Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024☆24Apr 8, 2024Updated last year
- ☆19Nov 13, 2022Updated 3 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆29Mar 1, 2024Updated 2 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 2 years ago
- NeurIPS Reproducibility Challenge 2019☆20Feb 25, 2020Updated 6 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆25May 20, 2024Updated last year
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Sep 17, 2018Updated 7 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Feb 6, 2023Updated 3 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated last month
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 4 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- ☆25Jun 30, 2022Updated 3 years ago
- Compare Laguerre-based MPC and Traditional MPC for platoon of vehicles.☆13Feb 14, 2023Updated 3 years ago
- GCS for swarm☆37Nov 25, 2025Updated 3 months ago
- Transport code for plasma simulations☆12Feb 13, 2025Updated last year
- ☆80Oct 3, 2023Updated 2 years ago
- ☆29Oct 3, 2023Updated 2 years ago