Improbable-AI / eipo
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆80Updated 2 years ago
Alternatives and similar repositories for eipo:
Users that are interested in eipo are comparing it to the libraries listed below
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆124Updated last year
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆39Updated 4 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆40Updated 2 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆48Updated 2 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆66Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆61Updated 10 months ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆52Updated 2 years ago
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆83Updated last year
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆33Updated 2 years ago
- Author's PyTorch implementation of TD7 for online and offline RL☆141Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆95Updated last year
- Implement many Sparse Reward algorithms in Gym Fetch environment☆86Updated 4 years ago
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆106Updated 3 years ago
- ☆109Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated 9 months ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆163Updated 5 months ago
- Baseline implementation of recurrent PPO using truncated BPTT☆139Updated 11 months ago
- DSAC; Distributional Soft Actor-Critic☆125Updated 2 months ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆133Updated 8 months ago
- Codes accompanying the paper "Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning" (NeurIPS…☆73Updated 2 years ago
- Code for "Constrained Variational Policy Optimization for Safe Reinforcement Learning" (ICML 2022)☆75Updated last year
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆52Updated 2 years ago
- [NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.☆85Updated 2 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆63Updated 5 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆177Updated 2 years ago
- Experiments to train transformer network to master reinforcement learning environments.☆32Updated 4 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆126Updated 8 months ago
- DecentralizedLearning☆24Updated 2 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆27Updated last year