tsumers / rewardsLinks
Code and data for Learning Rewards from Linguistic Feedback, AAAI '21
☆10Updated 4 years ago
Alternatives and similar repositories for rewards
Users that are interested in rewards are comparing it to the libraries listed below
Sorting:
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Implements the Messenger environment and EMMA model.☆23Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated last year
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆44Updated last year
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Change-Based Exploration Transfer☆35Updated 3 years ago
- Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML…☆26Updated 2 years ago
- Code for Model-Free Opponent Shaping (ICML 2022)☆18Updated 2 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 3 years ago
- ☆42Updated 3 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆21Updated last month
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Updated 2 years ago
- Learning Off-Policy with Online Planning [CoRL 2021 Best Paper Finalist]☆39Updated 2 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- Code for "Possibility Before Utility: Learning And Using Hierarchical Affordances" (ICLR 2022)☆14Updated 3 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- Scalable Opponent Shaping Experiments in JAX☆24Updated last year
- Codebase for project about unsupervised skill learning via variational inference and causality.☆42Updated last year
- ☆40Updated 3 years ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆18Updated 7 months ago
- Code for the paper "Learning to Assist Humans without Inferring Rewards"☆15Updated 11 months ago
- [CoRL 2020] COG: Connecting New Skills to Past Experience with Offline Reinforcement Learning☆33Updated 4 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 3 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆44Updated 2 years ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…☆19Updated 3 years ago
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆17Updated 2 years ago