stanfordnlp / wge
Workflow-Guided Exploration: sample-efficient RL agent for web tasks
☆111Updated last year
Alternatives and similar repositories for wge:
Users that are interested in wge are comparing it to the libraries listed below
- Graph-based Deep Q Network for Web Navigation☆47Updated 5 years ago
- MiniWoB++: a web interaction benchmark for reinforcement learning☆303Updated 11 months ago
- Demos for the MiniWoB++ benchmark☆19Updated 6 years ago
- Template-DQN and DRRN agent implementations☆22Updated last year
- Neural Programmer-Interpreter Implementation (Reed, de Freitas: https://arxiv.org/abs/1511.06279), in Tensorflow☆41Updated 6 years ago
- ☆82Updated 5 years ago
- ☆16Updated 3 years ago
- Super fast implementations of common benchmark text world games☆45Updated 2 months ago
- Mapping natural language commands to web elements☆37Updated 2 years ago
- We release dataset collected for our research, code that implement neural network models described in the paper, and scripts to reproduce…☆161Updated 3 years ago
- Experiment code for the ICLR 2020 paper "RTFM: Generalising to New Environment Dynamics via Reading".☆38Updated 3 years ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆214Updated 3 months ago
- impact-driven-exploration☆130Updated last year
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆203Updated last year
- Awesome RL: Papers, Books, Codes, Benchmarks☆115Updated last year
- I am implementing a lot of reinforcement learning and imitation learning algorithms since I'm sick of reading about them but not really u…☆51Updated 5 years ago
- Code for Emergent Translation in Multi-Agent Communication☆80Updated 6 years ago
- CLEVR-Robot: a reinforcement learning environment combining vision, language and control.☆130Updated 6 months ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆201Updated 4 years ago
- Adapting the AlphaZero algorithm to remove the need of execution traces to train NPI.☆79Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆102Updated 2 years ago
- On the pitfalls of measuring emergent communication☆34Updated 5 years ago
- A collection of baselines for the MineRL environment/datasets & the NeurIPS 2021 MineRL competitions☆147Updated 3 years ago
- Automatically Composing Representation Transformations as a Means for Generalization☆24Updated 5 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆197Updated 6 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Updated 6 years ago
- An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel☆72Updated 7 years ago
- Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)☆236Updated 6 years ago
- Code for the paper, "Learning Human Objectives by Evaluating Hypothetical Behavior"☆83Updated 5 years ago
- ☆43Updated 5 years ago