chandar-lab / Lifelong-HanabiLinks
A Continual Multi-agent RL testbed based on Hanabi
☆32Updated 4 years ago
Alternatives and similar repositories for Lifelong-Hanabi
Users that are interested in Lifelong-Hanabi are comparing it to the libraries listed below
Sorting:
- Code for our NeurIPS 2020 paper Improving Generalization in Reinforcement Learning with Mixture Regularization☆34Updated 5 years ago
- ☆15Updated 6 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Updated 3 years ago
- ☆17Updated 5 years ago
- Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning / ICLR 2020☆55Updated 5 years ago
- [ICLR 2020, Oral] Harnessing Structures for Value-Based Planning and Reinforcement Learning☆34Updated 5 years ago
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Updated 6 years ago
- Code for the paper Novelty Search in Representational Space for Sample Efficient Exploration presented at NeurIPS 2020.☆14Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 5 years ago
- This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.☆20Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning☆52Updated 4 years ago
- ☆19Updated 5 years ago
- ☆78Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆34Updated 2 years ago
- Variational Reinforcement Learning☆16Updated last year
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆36Updated 5 years ago
- ☆11Updated 3 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆45Updated 3 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆103Updated 2 years ago
- Code Release for Task Agnostic Dynamics Priors for Deep Reinforcement Learning☆12Updated 6 years ago
- [CVPR 2021] Official Implementation of VAI: Unsupervised Visual Attention and Invariance for Reinforcement Learning☆27Updated 3 years ago
- Source code for the Self-Paced Deep Reinforcement Learning Experiments☆32Updated 2 years ago
- Code to reproduce results on toy tasks and companion blog for the paper.☆21Updated 3 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Updated 2 years ago
- Implementation of Random Expert Distillation☆29Updated 6 years ago
- ☆43Updated 7 years ago
- ☆54Updated 6 years ago
- ICLR 2020 Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies☆18Updated 5 years ago