gabegrand / world-models
☆205Updated last year
Alternatives and similar repositories for world-models:
Users that are interested in world-models are comparing it to the libraries listed below
- Repository for the paper Stream of Search: Learning to Search in Language☆145Updated 3 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆258Updated 6 months ago
- ☆92Updated 10 months ago
- Reasoning with Language Model is Planning with World Model☆164Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 5 months ago
- ☆159Updated 2 years ago
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- ☆132Updated 6 months ago
- ☆115Updated 9 months ago
- An extensible benchmark for evaluating large language models on planning☆356Updated 2 weeks ago
- A repository for transformer critique learning and generation☆90Updated last year
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆207Updated last year
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆48Updated 6 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆155Updated 5 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆91Updated last year
- A domain-specific probabilistic programming language for modeling and inference with language models☆129Updated last week
- Composable inference algorithms with LLMs and programmable logic☆67Updated 5 months ago
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆146Updated 6 months ago
- Simple next-token-prediction for RLHF☆225Updated last year
- ☆175Updated last year
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆181Updated last year
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆206Updated last year
- Self-Alignment with Principle-Following Reward Models☆160Updated last year
- Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…☆207Updated 3 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆212Updated last week
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆107Updated last month
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆70Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆157Updated last year
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆67Updated last year
- ☆114Updated 9 months ago