microsoft / EPPOLinks
An implementation of effective policy ensemble.
☆16Updated 2 years ago
Alternatives and similar repositories for EPPO
Users that are interested in EPPO are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆12Updated 5 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆68Updated 5 years ago
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆46Updated 5 years ago
- ☆48Updated 3 years ago
- Reinforcement Learning via Latent State Decoding☆29Updated 2 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 6 years ago
- Model-Based Offline Reinforcement Learning☆52Updated 5 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Updated 5 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Updated 7 months ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆72Updated 3 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆48Updated 7 years ago
- CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning☆240Updated 3 years ago
- Code for Invariant Policy Optimization☆14Updated 5 years ago
- ☆18Updated 3 years ago
- ☆86Updated 4 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆35Updated 6 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Updated 3 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆106Updated 3 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆218Updated 3 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆176Updated last year
- ☆54Updated last year
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Updated 3 years ago
- Python interface for accessing the near real-world offline reinforcement learning (NeoRL) benchmark datasets☆133Updated last year
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆62Updated 6 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Updated 4 years ago
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆35Updated 4 years ago
- ☆92Updated 2 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 7 years ago
- ☆26Updated 2 years ago