ChristosKap / policy_consolidation
Code for Policy Consolidation for Continual Reinforcement Learning
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for policy_consolidation
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆32Updated last year
- ☆24Updated last year
- ☆20Updated 2 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆46Updated last year
- Code for the paper "Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation"☆17Updated last year
- ☆41Updated 6 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 5 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆19Updated last year
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆32Updated 4 years ago
- Generalizable Imitation Learning from Observation via Inferring Goal Proximity (NeurIPS 2021)☆22Updated 3 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Updated 5 years ago
- ☆85Updated 10 months ago
- ☆54Updated 8 months ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- Domain-Robust Visual Imitation Learning with Mutual Information Constraints code☆16Updated 3 years ago
- ☆15Updated 3 years ago
- ☆29Updated 5 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆15Updated 4 years ago
- ☆47Updated last year
- ☆13Updated 7 months ago
- ☆36Updated last year
- Authors' PyTorch implementation of 'Recomposing the Reinforcement Learning Building-Blocks with Hypernetworks' (HypeRL)☆24Updated 3 years ago
- ☆41Updated 3 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆21Updated last year
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Updated 5 years ago
- ☆38Updated 3 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 2 years ago
- Code for the paper "Learning Options via Compression" at NeurIPS 2022☆22Updated last year
- Implementation of the Prioritized Option-Critic on the Four-Rooms Environment☆15Updated 6 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago