cognitiveailab / TextWorldExpress
Super fast implementations of common benchmark text world games
☆45Updated 2 months ago
Alternatives and similar repositories for TextWorldExpress:
Users that are interested in TextWorldExpress are comparing it to the libraries listed below
- ☆79Updated 7 months ago
- ☆32Updated 3 years ago
- ☆20Updated 3 years ago
- ☆35Updated 7 months ago
- [EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games☆69Updated 3 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆52Updated 8 months ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆23Updated 2 years ago
- ☆27Updated last year
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆14Updated last week
- Knowledge-Aware RL agents with Commonsense Reasoning☆75Updated 2 years ago
- ☆38Updated 5 months ago
- ☆16Updated 3 years ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 3 months ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆33Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- ☆12Updated 3 years ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆48Updated 4 months ago
- ☆23Updated 5 months ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- A Toolkit for Distributional Control of Generative Models☆70Updated last year
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆40Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 7 months ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆11Updated 3 years ago
- ☆34Updated 10 months ago
- ☆30Updated last year
- Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you☆22Updated 3 weeks ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated last year
- ☆67Updated 2 years ago