matthewsparr / Deep-Zork
Using NLP and reinforcement learning to build an AI capable of playing text-based games
☆25Updated 5 years ago
Alternatives and similar repositories for Deep-Zork:
Users that are interested in Deep-Zork are comparing it to the libraries listed below
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆44Updated 5 months ago
- [EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games☆69Updated 4 years ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Updated last year
- ☆25Updated 9 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Based on the tree of thoughts paper☆46Updated last year
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- 🧪Create domain-adapted language models by distilling from many pre-trained LMs☆10Updated 2 years ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 9 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆15Updated last year
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆41Updated 2 months ago
- Exploring limitations of LLM-as-a-judge☆15Updated 7 months ago
- The Next Generation Multi-Modality Superintelligence☆71Updated 6 months ago
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆28Updated last year
- ☆18Updated last month
- ☆27Updated this week
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance☆27Updated 2 years ago
- Repo to reproduce the First-Explore paper results☆37Updated 3 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- A dataset of alignment research and code to reproduce it☆74Updated last year
- Experimental sampler to make LLMs more creative☆30Updated last year
- The official implementation of the paper "Read to Play (R2-Play): Decision Transformer with Multimodal Game Instruction".☆34Updated last year
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆66Updated last year
- ☆14Updated last year
- Source codes for the paper "Bounding the Capabilities of Large Language Models in Open Text Generation with Prompt Constraints"☆28Updated 2 years ago
- A Python reimplementation of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆91Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆21Updated 3 weeks ago
- ☆12Updated this week