jlin816 / rewards-from-language
Code and data for "Inferring Rewards from Language in Context" [ACL 2022].
☆15Updated 2 years ago
Alternatives and similar repositories for rewards-from-language:
Users that are interested in rewards-from-language are comparing it to the libraries listed below
- ☆20Updated 3 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆43Updated last year
- ☆12Updated 3 years ago
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆36Updated 2 years ago
- ☆24Updated 2 years ago
- Official code for the ACL 2021 Findings paper "Yichi Zhang and Joyce Chai. Hierarchical Task Learning from Language Instructions with Uni…☆24Updated 3 years ago
- ☆82Updated 8 months ago
- ☆33Updated last month
- ☆31Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 3 years ago
- Official code repository of the paper Learning Associative Inference Using Fast Weight Memory by Schlag et al.☆28Updated 4 years ago
- Code for EmBERT, a transformer model for embodied, language-guided visual task completion.☆57Updated last year
- ☆13Updated 2 years ago
- ☆37Updated 9 months ago
- Repository for DialFRED.☆42Updated last year
- The multi-modal sequence to sequence baseline neural models used in the Grounded SCAN paper.☆16Updated 4 years ago
- ☆22Updated 3 years ago
- Self-Supervised Alignment with Mutual Information☆17Updated 11 months ago
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks☆22Updated last year
- Solving reinforcement learning tasks which require language and vision☆32Updated 2 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- A visual semantic planner for the ALFRED virtual agent challenge using the GPT-2 language model☆14Updated 4 years ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆9Updated 2 weeks ago
- This repo contains all the codes for SEScore implementation☆14Updated last month
- ☆52Updated last year
- Grounded SCAN data set.☆69Updated 3 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆20Updated 3 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 7 months ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆10Updated 4 years ago