tsumers / rewardsLinks
Code and data for Learning Rewards from Linguistic Feedback, AAAI '21
☆10Updated 4 years ago
Alternatives and similar repositories for rewards
Users that are interested in rewards are comparing it to the libraries listed below
Sorting:
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 3 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated last year
- Implements the Messenger environment and EMMA model.☆25Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 3 years ago
- Fast reinforcement learning research☆61Updated 8 months ago
- PushWorld: A benchmark for manipulation planning with tools and movable obstacles☆83Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Simplistic Pytorch Implementation of the Dreamer-RL☆20Updated 3 months ago
- ☆54Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 3 years ago
- ☆28Updated 3 years ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆34Updated 2 years ago
- ☆54Updated 9 months ago
- Sandbox environment for generalizable agent research☆26Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Model-Based Reinforcement Learning via Latent-Space Collocation.☆33Updated 2 years ago
- Invariant Causal Prediction for Block MDPs☆44Updated 5 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 4 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Codebase for project about unsupervised skill learning via variational inference and causality.☆43Updated last year
- Learning Robust Dynamics Through Variational Sparse Gating☆20Updated 2 years ago
- Infer how suboptimal agents are suboptimal while planning, for example if they are hyperbolic time discounters.☆25Updated 4 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆69Updated 4 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- Change-Based Exploration Transfer☆35Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- MELD: Meta-Reinforcement Learning from Images via Latent State Models https://arxiv.org/abs/2010.13957☆63Updated 4 years ago