gkswamy98 / pillbox
Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.
☆21Updated 2 years ago
Related projects: ⓘ
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆32Updated 2 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆96Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆76Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆39Updated last year
- Reinforcement Learning with Latent Flow☆42Updated 3 years ago
- ☆34Updated last year
- Vectorization techniques for fast population-based training.☆52Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆30Updated 4 years ago
- ☆27Updated last year
- On the model-based stochastic value gradient for continuous reinforcement learning☆54Updated last year
- Sandbox environment for generalizable agent research☆22Updated 2 years ago
- My Body Is A Cage☆37Updated 3 years ago
- Invariant Causal Prediction for Block MDPs☆43Updated 4 years ago
- ☆33Updated last year
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 3 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆41Updated 5 months ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆81Updated 2 years ago
- Official codebase for Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings.☆21Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 3 months ago
- ☆27Updated 3 years ago
- ☆85Updated last month
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆28Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆49Updated 2 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆100Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆67Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆33Updated last year