HumanCompatibleAI / rlsp
Reward Learning by Simulating the Past
☆44Updated 5 years ago
Alternatives and similar repositories for rlsp:
Users that are interested in rlsp are comparing it to the libraries listed below
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆43Updated 3 years ago
- ☆44Updated 6 years ago
- ☆80Updated last year
- Generalised UDRL☆37Updated 2 years ago
- Inferring beliefs about dynamics from behavior☆29Updated 6 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆23Updated 4 years ago
- Variational Reinforcement Learning☆16Updated 7 months ago
- Official implementation of DynE, Dynamics-aware Embeddings for RL☆43Updated 3 years ago
- On the pitfalls of measuring emergent communication☆34Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆25Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- A collection of reading material for the Workshop on "Structure & Priors in Reinforcement Learning" (SPiRL) at ICLR 2019.☆13Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆67Updated 5 years ago
- Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)☆74Updated 5 years ago
- Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)☆44Updated last year
- Solving reinforcement learning tasks which require language and vision☆32Updated last year
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆78Updated 5 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- Public Release of Plan2vec Implementation in pyTorch☆56Updated 2 years ago
- ☆20Updated 5 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆16Updated 5 years ago
- ☆13Updated 6 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆102Updated last year
- E2C implementation in PyTorch☆43Updated 7 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆123Updated 5 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆133Updated 7 months ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago