tsumers / rewards
Code and data for Learning Rewards from Linguistic Feedback, AAAI '21
☆10Updated 4 years ago
Alternatives and similar repositories for rewards:
Users that are interested in rewards are comparing it to the libraries listed below
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated last year
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 2 years ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 9 months ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆32Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 8 months ago
- ☆40Updated 3 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Updated last year
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…☆29Updated 2 years ago
- ☆35Updated 2 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated 2 years ago
- ☆15Updated 3 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆36Updated 2 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 3 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆15Updated 10 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- ☆22Updated 3 years ago
- using information theory to encourage agents to cooperate and compete☆19Updated 6 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆36Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- My Body Is A Cage☆39Updated 3 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30Updated 2 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆30Updated 3 years ago