Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆29Sep 10, 2020Updated 5 years ago
Alternatives and similar repositories for MPO
Users that are interested in MPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of the Maximum a Posteriori Policy Optimisation☆84Nov 19, 2022Updated 3 years ago
- Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)☆20Nov 26, 2020Updated 5 years ago
- Prioritized Sequence Experience Replay☆10Aug 16, 2021Updated 4 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)☆16Oct 23, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Jan 4, 2021Updated 5 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆85Jul 27, 2022Updated 3 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆19Mar 1, 2021Updated 5 years ago
- ☆13Jul 9, 2018Updated 7 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆20Mar 22, 2022Updated 4 years ago
- Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines☆52Jun 3, 2022Updated 3 years ago
- NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.☆25May 20, 2024Updated last year
- Laser cutting and design files for the stretchable circuits presented in Soft Arduinos.☆17Sep 9, 2024Updated last year
- ☆21May 29, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Jul 12, 2021Updated 4 years ago
- A minimal Unreal Engine project for developing and testing UnrealCV☆17Nov 8, 2018Updated 7 years ago
- A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.☆14Dec 8, 2020Updated 5 years ago
- Tabula Rasa Tic-Tac-Toe☆10Jan 3, 2019Updated 7 years ago
- Implementation of Behavioral Cloning from Observationmentation☆16Nov 28, 2019Updated 6 years ago
- Understanding RL vision Distill article☆25Mar 3, 2023Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- ☆19Nov 13, 2022Updated 3 years ago
- ☆11Aug 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.☆18Mar 16, 2022Updated 4 years ago
- ☆80Oct 3, 2023Updated 2 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Jan 19, 2023Updated 3 years ago
- Density Constrained Reinforcement Learning☆12Mar 24, 2023Updated 3 years ago
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆30Mar 1, 2024Updated 2 years ago
- ☆14Oct 11, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- Bayesian active RL (BARL) and trajectory information planning (TIP)☆26Oct 11, 2022Updated 3 years ago
- A minimal(ish) reinforcement learning library that aggregates reliable implementations.☆27Jun 2, 2025Updated 10 months ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Mar 14, 2021Updated 5 years ago
- Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)☆15Jan 19, 2021Updated 5 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago