tsumers / rewards
Code and data for Learning Rewards from Linguistic Feedback, AAAI '21
☆10Updated 4 years ago
Alternatives and similar repositories for rewards:
Users that are interested in rewards are comparing it to the libraries listed below
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 7 months ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- Intrinsic Reward Matching (IRM) implementation (from Adeniji and Xie et al 2022)☆43Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆18Updated 2 years ago
- Generalised UDRL☆37Updated 2 years ago
- ☆40Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 2 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated 2 years ago
- My Body Is A Cage☆39Updated 3 years ago
- Bipedal Skills Benchmark for Reinforcement Learning☆26Updated 2 years ago
- ☆53Updated 3 months ago
- EARL: Environment for Autonomous Reinforcement Learning☆36Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 9 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- ☆14Updated 2 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆25Updated 2 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- ☆15Updated 3 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 3 years ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 2 years ago
- ☆16Updated 3 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated 2 years ago
- Lipschitz-constrained Unsupervised Skill Discovery (ICLR 2022)☆34Updated last year
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 3 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago