acyclics/MPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/acyclics/MPO)

acyclics / MPO

Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments

☆29

Alternatives and similar repositories for MPO

Users that are interested in MPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

daisatojp / mpo
View on GitHub
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
☆84Nov 19, 2022Updated 3 years ago
tgangwani / SelfImitationDiverse
View on GitHub
Tensorflow code for "Learning Self-Imitating Diverse Policies" (ICLR 2019)
☆20Nov 26, 2020Updated 5 years ago
marcbrittain / Prioritized-Sequence-Experience-Replay
View on GitHub
Prioritized Sequence Experience Replay
☆10Aug 16, 2021Updated 4 years ago
YYCAAA / V-MPO_Lunarlander
View on GitHub
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Nov 10, 2020Updated 5 years ago
yardenas / panda-rl-kit
View on GitHub
Deploy RL on your Real-World Franka Emika Panda
☆15Feb 22, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
DRL-CASIA / Deep-Reinforcement-Learning
View on GitHub
☆18Jan 4, 2021Updated 5 years ago
rll-research / cic
View on GitHub
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆88Jul 27, 2022Updated 3 years ago
Aladoro / domain-robust-visual-il
View on GitHub
Domain-Robust Visual Imitation Learning with Mutual Information Constraints code
☆19Mar 1, 2021Updated 5 years ago
kavosh8 / Lip
View on GitHub
☆13Jul 9, 2018Updated 8 years ago
notmahi / disk
View on GitHub
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆21Mar 22, 2022Updated 4 years ago
jakegrigsby / deep_control
View on GitHub
Deep Reinforcement Learning for Continuous Control in PyTorch
☆106Dec 31, 2021Updated 4 years ago
brett-daley / dqn-lambda
View on GitHub
NeurIPS 2019: DQN(λ) = Deep Q-Network + λ-returns.
☆25May 20, 2024Updated 2 years ago
liyheng / FOP
View on GitHub
☆14Jul 12, 2021Updated 5 years ago
unrealcv / playground
View on GitHub
A minimal Unreal Engine project for developing and testing UnrealCV
☆17Nov 8, 2018Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mjanschek / pytorch_seed_rl
View on GitHub
A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.
☆14Dec 8, 2020Updated 5 years ago
khushhallchandra / pytorch-rl
View on GitHub
Pytorch Implementation of RL algorithms
☆15Feb 26, 2018Updated 8 years ago
shkr / bayesian_regression
View on GitHub
Bayesian Regression Models using pymc3
☆11Feb 4, 2017Updated 9 years ago
jerrylin1121 / BCO
View on GitHub
Implementation of Behavioral Cloning from Observationmentation
☆16Nov 28, 2019Updated 6 years ago
distillpub / post--understanding-rl-vision
View on GitHub
Understanding RL vision Distill article
☆25Mar 3, 2023Updated 3 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
Neo-X / SMiRL_Code
View on GitHub
☆20Nov 13, 2022Updated 3 years ago
clvrai / agile
View on GitHub
Official implementation of "Know Your Action Set: Learning Action Relations for Reinforcement Learning", Jain et al., ICLR 2022.
☆18Mar 16, 2022Updated 4 years ago
ArnaudFickinger / adversarial-surprise
View on GitHub
Explore and Control with Adversarial Surprise
☆10Jul 20, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
facebookresearch / hsd3
View on GitHub
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
☆52Jun 3, 2022Updated 4 years ago
jsikyoon / bmaml_rl
View on GitHub
This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.
☆20Jan 19, 2023Updated 3 years ago
MIT-REALM / dcrl
View on GitHub
Density Constrained Reinforcement Learning
☆12Mar 24, 2023Updated 3 years ago
Improbable-AI / curiosity_baselines
View on GitHub
An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.
☆11Feb 6, 2023Updated 3 years ago
resibots / chatzilygeroudis_2018_rte
View on GitHub
Code for the Reset-free Trial and Error learning paper (RTE) experiments
☆10Jan 3, 2018Updated 8 years ago
anirudh9119 / rl_adversarial
View on GitHub
Learning Backtracking Models, ICLR'19
☆10Feb 2, 2018Updated 8 years ago
PKU-RL / PTGM
View on GitHub
[ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning
☆30Mar 1, 2024Updated 2 years ago
yashbonde / Transformer-RL
View on GitHub
Experiments to train transformer network to master reinforcement learning environments.
☆32Mar 14, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tmoer / a0c
View on GitHub
Code for the paper Alpha Zero in Continuous Action Space (A0C) (https://arxiv.org/pdf/1805.09613.pdf)
☆15Jan 19, 2021Updated 5 years ago
zafstojano / policy-gradients
View on GitHub
A minimal hackable implementation of policy gradient methods (GRPO, PPO, REINFORCE)
☆16Feb 20, 2026Updated 5 months ago
chscheller / minerl_agent
View on GitHub
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Mar 24, 2023Updated 3 years ago
abhayraw1 / planet-torch
View on GitHub
A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning
☆13Aug 31, 2020Updated 5 years ago
braraki / logical-options-framework
View on GitHub
☆10Jun 7, 2021Updated 5 years ago
seohongpark / PMA
View on GitHub
Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)
☆33Feb 6, 2023Updated 3 years ago
EnergyQuantResearch / DF-SRL
View on GitHub
DistFlow Safe Reinforcement Learning Algorithm for Voltage Magnitude Regulation in Distribution Networks
☆14Jul 9, 2025Updated last year