microsoft / EPPOLinks
An implementation of effective policy ensemble.
☆16Updated 2 years ago
Alternatives and similar repositories for EPPO
Users that are interested in EPPO are comparing it to the libraries listed below
Sorting:
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆72Updated 3 years ago
- Reinforcement Learning via Latent State Decoding☆29Updated 2 years ago
- MetaGenRL, a novel meta reinforcement learning algorithm. Unlike prior work, MetaGenRL can generalize to new environments that are entire…☆69Updated 5 years ago
- Author's PyTorch implementation of paper "Provably Good Batch Reinforcement Learning Without Great Exploration"☆12Updated 5 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆106Updated 3 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Updated 5 years ago
- Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.☆28Updated 3 years ago
- ☆40Updated 4 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Updated 6 years ago
- PyTorch implementation of "Model-based Reinforcement Learning for Semi-Markov Decision Processes with Neural ODEs", NeurIPS 2020☆46Updated 5 years ago
- Model-Based Offline Reinforcement Learning☆52Updated 5 years ago
- ☆54Updated last year
- ☆26Updated 3 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Updated 5 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Updated 7 months ago
- ☆19Updated 2 years ago
- ☆18Updated 3 years ago
- ☆26Updated 2 years ago
- Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"☆23Updated 2 years ago
- Implementation of our paper "Meta Reinforcement Learning with Task Embedding and Shared Policy"☆35Updated 6 years ago
- Source code for the Self-Paced Deep Reinforcement Learning Experiments☆32Updated 2 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆20Updated 4 years ago
- ☆89Updated last year
- Code for Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation @ NeurIPS 2021☆13Updated 4 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Updated 3 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Updated 3 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Updated last year
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191Updated 3 years ago