cognitiveailab / TextWorldExpress
Super fast implementations of common benchmark text world games
☆46Updated last week
Alternatives and similar repositories for TextWorldExpress:
Users that are interested in TextWorldExpress are comparing it to the libraries listed below
- ☆36Updated 8 months ago
- ☆20Updated 3 years ago
- [EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games☆69Updated 4 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆53Updated 9 months ago
- ☆32Updated 3 years ago
- ☆82Updated 8 months ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆48Updated 5 months ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆33Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 4 months ago
- Knowledge-Aware RL agents with Commonsense Reasoning☆77Updated 3 years ago
- ☆23Updated 6 months ago
- ☆34Updated 11 months ago
- Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you☆22Updated 3 weeks ago
- Repository for Skill Set Optimization☆12Updated 7 months ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated last year
- ☆50Updated last year
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆15Updated 2 years ago
- ☆29Updated last year
- ☆12Updated 3 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆15Updated last month
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆25Updated last year
- ☆23Updated 6 months ago
- ☆16Updated 3 years ago
- ☆33Updated last year
- Template-DQN and DRRN agent implementations☆22Updated last year
- A Toolkit for Distributional Control of Generative Models☆72Updated last year
- ☆13Updated 3 years ago
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 8 months ago
- Mechanistic Interpretability for Transformer Models☆50Updated 2 years ago