tsumers / rewards
Code and data for Learning Rewards from Linguistic Feedback, AAAI '21
☆10Updated 4 years ago
Alternatives and similar repositories for rewards:
Users that are interested in rewards are comparing it to the libraries listed below
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Change-Based Exploration Transfer☆36Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 11 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated last year
- Exploring techniques to generate diverse conventions in multi-agent settings☆14Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆17Updated 6 months ago
- ☆41Updated 3 years ago
- Official codebase for LEAP: Planning with Goal Conditioned Policies☆50Updated 2 years ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆14Updated 2 years ago
- ☆36Updated 2 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- Codebase for project about unsupervised skill learning via variational inference and causality.☆42Updated last year
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆26Updated 2 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆23Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- My Body Is A Cage☆40Updated 4 years ago
- Generalised UDRL☆37Updated 2 years ago
- Latent World Models For Intrinsically Motivated Exploration | Official repository☆22Updated 4 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 10 months ago
- ☆53Updated 6 months ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆104Updated 2 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- ☆23Updated 3 years ago