Mohammadamin-Barekatain / multipolar
Multi-Source Policy Aggregation for Transfer Reinforcement Learning between Diverse Environmental Dynamics
☆9Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for multipolar
- ☆25Updated 4 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated last year
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆18Updated 2 years ago
- Unofficial Re-implementation of "Dream to Control: Learning Behaviors by Latent Imagination" (https://arxiv.org/abs/1912.01603 ) with PyT…☆27Updated 4 years ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆51Updated 3 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Updated 3 years ago
- Unofficial re-implementation of "Learning Latent Dynamics for Planning from Pixels" (https://arxiv.org/abs/1811.04551 ) with PyTorch☆43Updated 4 years ago
- A library for building reinforcement learning and imitation learning agents in Pytorch☆58Updated 4 years ago
- ☆16Updated 5 years ago
- ☆9Updated 2 years ago
- Advantage weighted Actor Critic for Offline RL☆47Updated 2 years ago
- ☆31Updated 3 years ago
- NeurIPS Reproducibility Challenge 2019☆20Updated 4 years ago
- ☆18Updated 5 years ago
- Code accompanying HAAR paper, NeurIPS 2019 - Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards☆30Updated last year
- A standalone library to randomize various OpenAI Gym Environments☆60Updated 5 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆33Updated 5 years ago
- Safe Model-based Reinforcement Learning with Robust Cross-Entropy Method☆62Updated last year
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆44Updated 4 years ago
- ☆52Updated last year
- Code for "Divide-and-Conquer Reinforcement Learning"☆60Updated 5 years ago
- accompanying code for neurips submission "Goal-conditioned Imitation Learning"☆67Updated last year
- Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]☆48Updated 3 years ago
- Library for model based RL in robotics☆37Updated 6 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14Updated 6 years ago
- Residual policy learning☆58Updated 5 years ago
- Implementation of ICML2020 paper <Bidirectional Model-based Policy Optimization>☆23Updated last year
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Author's PyTorch implementation of LAP and PAL with TD3 and DDQN☆34Updated 2 years ago
- Proximal Policy Option-Critic☆21Updated 5 years ago