russellmendonca / mier_public
☆13Updated last year
Related projects: ⓘ
- ☆23Updated last year
- My Body Is A Cage☆37Updated 3 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 5 years ago
- ☆29Updated 3 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆28Updated 3 years ago
- Latent Dynamics Mixture, NeurIPS 2021☆17Updated last year
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆15Updated 3 years ago
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆17Updated last year
- ☆53Updated 6 months ago
- ☆12Updated 2 years ago
- Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"☆17Updated last year
- Learning from Trajectories via Subgoal Discovery☆13Updated 3 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- ☆17Updated 2 years ago
- ☆13Updated 5 months ago
- ☆41Updated 5 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆45Updated last year
- Implementation of Data Efficient Reinforcement Learning in Pytorch☆20Updated 5 years ago
- Code for FOCAL Paper Published at ICLR 2021☆49Updated 9 months ago
- ☆44Updated last year
- ☆41Updated 3 years ago
- ☆14Updated 3 years ago
- Image-based gridworld experiment for learning Markov state abstractions☆18Updated this week
- Code for Paper "State Alignment-based Imitation Learning". Under maintenance☆16Updated 4 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆18Updated 2 years ago
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆22Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆64Updated 2 years ago