Improbable-AI/eipo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Improbable-AI/eipo)

Improbable-AI / eipo

Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization

☆83

Alternatives and similar repositories for eipo

Users that are interested in eipo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SAIC-MONTREAL / hyperzero
View on GitHub
Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"
☆24Apr 26, 2023Updated 3 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
51616 / marl-lipo
View on GitHub
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19May 10, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
notmahi / disk
View on GitHub
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆21Mar 22, 2022Updated 4 years ago
twitter-research / hyperbolic-rl
View on GitHub
☆60Sep 22, 2022Updated 3 years ago
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
View on GitHub
Source files to replicate experiments in my ICLR 2022 paper.
☆74Jul 17, 2025Updated last year
Baichenjia / Contrastive-UCB
View on GitHub
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
☆12Jun 16, 2022Updated 4 years ago
robbycostales / HAL
View on GitHub
Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)
☆14Mar 14, 2022Updated 4 years ago
facebookresearch / cascade
View on GitHub
Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).
☆30Oct 25, 2022Updated 3 years ago
tdmpc2 / tdmpc2-eval
View on GitHub
Evaluation of TD-MPC2.
☆21Jan 21, 2024Updated 2 years ago
clvrai / skill-chaining
View on GitHub
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)
☆38May 3, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
ucl-dark / pax
View on GitHub
Scalable Opponent Shaping Experiments in JAX
☆27Apr 13, 2024Updated 2 years ago
PKU-MARL / TRPO-PPO-in-MARL
View on GitHub
☆16May 5, 2022Updated 4 years ago
garrett4wade / revisiting_marl
View on GitHub
Official codebase for paper "Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning" (ICML22)
☆23Jul 16, 2022Updated 4 years ago
wisnunugroho21 / reinforcement_learning_v_mpo
View on GitHub
Deep Reinforcement Learning by using an on-policy adaptation of Maximum a Posteriori Policy Optimization (MPO)
☆16Oct 23, 2021Updated 4 years ago
google-deepmind / csuite
View on GitHub
☆47Sep 24, 2024Updated last year
fusion-ml / trajectory-information-rl
View on GitHub
Bayesian active RL (BARL) and trajectory information planning (TIP)
☆26Oct 11, 2022Updated 3 years ago
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆83May 13, 2024Updated 2 years ago
sail-sg / optim4rl
View on GitHub
Optim4RL is a Jax framework of learning to optimize for reinforcement learning.
☆28Nov 27, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
danijar / director
View on GitHub
Deep Hierarchical Planning from Pixels
☆122Dec 21, 2022Updated 3 years ago
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
facebookresearch / dcd
View on GitHub
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
☆141Aug 20, 2024Updated last year
si0wang / COPlanner
View on GitHub
☆23Apr 2, 2024Updated 2 years ago
sahandrez / homomorphic_policy_gradient
View on GitHub
Author's PyTorch Implementation of Deep Homomorphic Policy Gradient (DHPG) - NeurIPS 2022 and JMLR 2024
☆24Apr 8, 2024Updated 2 years ago
Shanghai-Digital-Brain-Laboratory / DB-Football
View on GitHub
A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.
☆118Jan 16, 2024Updated 2 years ago
facebookresearch / drqv2
View on GitHub
DrQ-v2: Improved Data-Augmented Reinforcement Learning
☆437May 31, 2022Updated 4 years ago
tseyde / decqn
View on GitHub
☆35Jan 4, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
denisyarats / exorl
View on GitHub
ExORL: Exploratory Data for Offline Reinforcement Learning
☆138Feb 8, 2022Updated 4 years ago
sash-a / CleanRL.jl
View on GitHub
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆24Feb 15, 2025Updated last year
rll-research / cic
View on GitHub
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆88Jul 27, 2022Updated 3 years ago
Improbable-AI / curiosity_baselines
View on GitHub
An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.
☆11Feb 6, 2023Updated 3 years ago
zhaoyi11 / tcrl
View on GitHub
☆26Jan 26, 2024Updated 2 years ago
astanic / crafter-ood
View on GitHub
☆19Nov 25, 2022Updated 3 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
View on GitHub
Baseline implementation of recurrent PPO using truncated BPTT
☆161Apr 28, 2024Updated 2 years ago