Improbable-AI / eipoLinks
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆81Updated 2 years ago
Alternatives and similar repositories for eipo
Users that are interested in eipo are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆171Updated 9 months ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆51Updated 3 months ago
- Code for the paper: Hierarchical Reinforcement Learning With Timed Subgoals, published at NeurIPS 2021☆34Updated 3 years ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆70Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆179Updated 3 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆132Updated last year
- Safe Multi-Agent MuJoCo benchmark for safe multi-agent reinforcement learning research.☆65Updated last year
- [ECCV2022] [T-PAMI] StARformer: Transformer with State-Action-Reward Representations.☆94Updated 2 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆46Updated 3 years ago
- A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data…☆38Updated last year
- Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.☆72Updated last year
- Implementation of Multi-Game Decision Transformers in PyTorch☆47Updated 2 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆70Updated last month
- This repo relates to the survey paper <Goal-Conditioned Reinforcement Learning: Problems and Solutions>. We collects widely used benchmar…☆142Updated 2 years ago
- MATE: the Multi-Agent Tracking Environment.☆47Updated 2 years ago
- Advantage weighted Actor Critic for Offline RL☆50Updated 3 years ago
- PyTorch implementation of our paper Reinforcement Learning with Random Delays (ICLR 2020)☆42Updated 3 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆32Updated 2 years ago
- Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).☆87Updated last year
- behavior cloning from observation☆36Updated 4 years ago
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆187Updated last year
- 🔥 Datasets and env wrappers for offline safe reinforcement learning☆98Updated 11 months ago
- ☆43Updated 2 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆27Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆184Updated 3 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆152Updated last year
- Implementation of HIRO (Data-Efficient Hierarchical Reinforcement Learning)☆111Updated 4 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆60Updated 2 years ago
- A Modular Library for Off-Policy Reinforcement Learning with a focus on SafeRL and distributed computing☆134Updated last month
- PyTorch Implementation of FeUdal Networks for Hierarchical Reinforcement Learning (FuNs), Vezhnevets et al. 2017.☆40Updated 5 years ago