TianhongDai / metaworld-sac
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for metaworld-sac
- Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)☆23Updated 5 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- My Body Is A Cage☆38Updated 3 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated last year
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Code for Sibling Rivalry and experiments presented in associated paper☆16Updated 3 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆28Updated 3 years ago
- Code and project page for D-REX algorithm from the paper "Better-than-Demonstrator Imitation Learning via Automatically-Ranked Demonstrat…☆49Updated last year
- ☆41Updated 6 years ago
- ☆44Updated 2 years ago
- MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957☆63Updated 3 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆78Updated 2 years ago
- ☆41Updated 3 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆53Updated 5 years ago
- ☆45Updated last year
- Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3☆24Updated 10 months ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆69Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆31Updated 4 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆34Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆98Updated 2 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆61Updated 4 years ago
- AGAC: Adversarially Guided Actor-Critic☆47Updated 3 years ago
- Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"☆21Updated 3 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆43Updated last year
- Source files to replicate experiments in my ICLR 2022 paper.☆61Updated 4 months ago
- ☆40Updated 3 years ago
- Single Episode Policy Transfer in Reinforcement Learning☆17Updated 2 years ago