tsumers / rewardsLinks
Code and data for Learning Rewards from Linguistic Feedback, AAAI '21
☆10Updated 4 years ago
Alternatives and similar repositories for rewards
Users that are interested in rewards are comparing it to the libraries listed below
Sorting:
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Change-Based Exploration Transfer☆36Updated 3 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 3 years ago
- Code for the paper "Learning to Assist Humans without Inferring Rewards"☆15Updated 11 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- ☆36Updated 2 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆50Updated 2 years ago
- Codebase for project about unsupervised skill learning via variational inference and causality.☆42Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆14Updated last year
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆33Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 2 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆55Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆39Updated 2 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆33Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆17Updated 3 years ago
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- ☆42Updated 3 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- My Body Is A Cage☆41Updated 4 years ago