gabegrand / world-models
☆198Updated last year
Related projects ⓘ
Alternatives and complementary repositories for world-models
- An extensible benchmark for evaluating large language models on planning☆291Updated 6 months ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆220Updated last month
- ☆122Updated 2 weeks ago
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆55Updated 10 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆101Updated this week
- Reasoning with Language Model is Planning with World Model☆145Updated last year
- ☆73Updated 4 months ago
- ☆105Updated 4 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆95Updated last year
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆199Updated last year
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆80Updated last week
- A repository for transformer critique learning and generation☆86Updated 11 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆164Updated this week
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆169Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆104Updated 5 months ago
- SwiftSage: A Generative Agent with Fast and Slow Thinking for Complex Interactive Tasks☆279Updated 3 weeks ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆164Updated last year
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆77Updated 3 months ago
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆86Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆220Updated 2 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆108Updated last year
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆219Updated 5 months ago
- Official Repo of LangSuitE☆78Updated 3 months ago
- ☆137Updated 6 months ago
- Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".☆155Updated 6 months ago
- ☆158Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆105Updated 7 months ago
- Can Language Models Solve Olympiad Programming?☆100Updated 3 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).