daisatojp/mpo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/daisatojp/mpo)

daisatojp / mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation

☆84

Alternatives and similar repositories for mpo

Users that are interested in mpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

acyclics / MPO
View on GitHub
Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments
☆29Sep 10, 2020Updated 5 years ago
YYCAAA / V-MPO_Lunarlander
View on GitHub
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Nov 10, 2020Updated 5 years ago
theogruner / rl_pro_telu
View on GitHub
☆23Jun 8, 2021Updated 5 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
wisnunugroho21 / reinforcement_learning_v_mpo
View on GitHub
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Oct 23, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
abhayraw1 / planet-torch
View on GitHub
A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning
☆13Aug 31, 2020Updated 5 years ago
jsikyoon / V-MPO_torch
View on GitHub
V-MPO torch version with DMLab30 and GTrXL
☆13Mar 1, 2021Updated 5 years ago
twitter-research / hyperbolic-rl
View on GitHub
☆60Sep 22, 2022Updated 3 years ago
real-stanford / ASPiRe
View on GitHub
[NeurIPS 2022] ASPiRe: Adaptive Skill Priors for Reinforcement Learning
☆13Oct 19, 2022Updated 3 years ago
OrionZou / mcppoElegantRLforCarla
View on GitHub
☆10Aug 16, 2022Updated 3 years ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
View on GitHub
Source files to replicate experiments in my ICLR 2022 paper.
☆74Jul 17, 2025Updated last year
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
hr0nix / omega
View on GitHub
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆44Sep 19, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jimimvp / torch_rl
View on GitHub
Reinforcement learning library for PyTorch.
☆11Jun 15, 2018Updated 8 years ago
Michael-Beukman / RobocupGym
View on GitHub
Reinforcement Learning inside a 3D soccer simulation
☆37Sep 15, 2024Updated last year
fusion-ml / trajectory-information-rl
View on GitHub
Bayesian active RL (BARL) and trajectory information planning (TIP)
☆26Oct 11, 2022Updated 3 years ago
SaminYeasar / Off_Policy_Adversarial_Inverse_Reinforcement_Learning
View on GitHub
Implementation of Off Policy Adversarial Inverse Reinforcement Learning
☆23Oct 9, 2020Updated 5 years ago
YuriCat / MuesliJupyterExample
View on GitHub
☆18Nov 4, 2021Updated 4 years ago
0xangelo / gym-cartpole-swingup
View on GitHub
A simple, continuous-control environment for OpenAI Gym
☆23Jan 1, 2023Updated 3 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
51616 / marl-lipo
View on GitHub
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19May 10, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sparisi / cbet
View on GitHub
Change-Based Exploration Transfer
☆35Apr 24, 2022Updated 4 years ago
frt03 / generalized_dt
View on GitHub
Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)
☆70Aug 8, 2022Updated 3 years ago
Kaixhin / EC
View on GitHub
Episodic Control
☆22Sep 20, 2022Updated 3 years ago
facebookresearch / hsd3
View on GitHub
Code for "Hierarchical Skills for Efficient Exploration" HSD-3 Algorithm and Baselines
☆52Jun 3, 2022Updated 4 years ago
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆83May 13, 2024Updated 2 years ago
dnishio / DSAC
View on GitHub
The implementation of Discriminator Soft Actor Critic
☆15Jan 25, 2020Updated 6 years ago
denisyarats / dmc2gym
View on GitHub
OpenAI Gym wrapper for the DeepMind Control Suite
☆229May 19, 2024Updated 2 years ago
PKU-MARL / TRPO-PPO-in-MARL
View on GitHub
☆16May 5, 2022Updated 4 years ago
SapanaChaudhary / PyTorch-CPO
View on GitHub
PyTorch implementation of Constrained Policy Optimization
☆58Oct 19, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
johanobandoc / revisiting_rainbow
View on GitHub
Revisiting Rainbow
☆76Jun 9, 2021Updated 5 years ago
DramaCow / jaxued
View on GitHub
☆98Jan 21, 2026Updated 6 months ago
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
Itomigna2 / Muesli-lunarlander
View on GitHub
Muesli RL algorithm implementation (PyTorch) (LunarLander-v2)
☆20Mar 18, 2024Updated 2 years ago
tdmpc2 / tdmpc2-eval
View on GitHub
Evaluation of TD-MPC2.
☆21Jan 21, 2024Updated 2 years ago
TrentBrick / RewardConditionedUDRL
View on GitHub
Open source code combining implementations of Upside Down Reinforcement Learning and Reward Conditioned Policies
☆19Mar 10, 2021Updated 5 years ago
uoe-agents / TED
View on GitHub
Official repository for "Temporal Disentanglement of Representations for Improved Generalisation in Reinforcement Learning".
☆13Jan 25, 2023Updated 3 years ago