nicklashansen / policy-adaptation-during-deployment
Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.
☆111Updated 3 years ago
Related projects: ⓘ
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- ☆41Updated 3 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆142Updated 3 years ago
- rllab's viskit with some added features☆73Updated last year
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆158Updated 2 years ago
- [ICML 2020] PyTorch Code for "One Policy to Control Them All: Shared Modular Policies for Agent-Agnostic Control"☆215Updated last year
- DMControl Generalization Benchmark☆166Updated 8 months ago
- State Representation Learning (SRL) zoo with PyTorch - Part of S-RL Toolbox☆161Updated 5 years ago
- ☆107Updated last year
- Proto-RL: Reinforcement Learning with Prototypical Representations☆81Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆157Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆200Updated 4 months ago
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆75Updated 9 months ago
- Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.☆90Updated last year
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆96Updated 2 years ago
- ☆52Updated 4 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆149Updated 3 years ago
- Change-Based Exploration Transfer☆35Updated 2 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆123Updated 5 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆40Updated last year
- cordial-sync is a software package than can be used to reproduce the results from the paper "A Cordial Sync: Going Beyond Marginal Polici…☆37Updated 3 years ago
- ☆69Updated 2 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆129Updated last year
- impact-driven-exploration☆125Updated 11 months ago
- accompanying code for neurips submission "Goal-conditioned Imitation Learning"☆67Updated last year
- A list of papers regarding generalization in (deep) reinforcement learning☆141Updated last year