microsoft / EPPO
An implementation of effective policy ensemble.
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for EPPO
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆17Updated 3 months ago
- The collection of the research works about Automatic Reinforcement Learning in Microsoft Research Asia.☆48Updated last year
- Reinforcement Learning via Latent State Decoding☆30Updated last year
- ☆8Updated 2 years ago
- ☆12Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆69Updated last year
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆13Updated 2 years ago
- ☆30Updated 3 months ago
- Logarithmic Reinforcement Learning☆26Updated last year
- Sandbox environment for generalizable agent research☆23Updated 2 years ago
- Official code for "RAMBO: Robust Adversarial Model-Based Offline RL", NeurIPS 2022☆25Updated last year
- Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)☆35Updated last year
- Offline Risk-Averse Actor-Critic (O-RAAC). A model-free RL algorithm for risk-averse RL in a fully offline setting☆33Updated 3 years ago
- ☆40Updated 3 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆105Updated 2 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 3 years ago
- Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)☆18Updated 3 years ago
- Source code for the Self-Paced Deep Reinforcement Learning Experiments☆30Updated last year
- ☆29Updated last year
- Implementation of VALOR (Variational Option Discovery Algorithms)