gabegrand / world-models
☆198Updated last year
Related projects ⓘ
Alternatives and complementary repositories for world-models
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆214Updated 3 weeks ago
- An extensible benchmark for evaluating large language models on planning☆289Updated 5 months ago
- ☆73Updated 4 months ago
- Reasoning with Language Model is Planning with World Model☆144Updated last year
- ☆122Updated last week
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆98Updated 4 months ago
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆103Updated 7 months ago
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆168Updated last year
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆92Updated last year
- Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".☆144Updated 10 months ago
- Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).☆195Updated last week
- ☆158Updated last year
- 🔗 LINC: Logical Inference via Neurosymbolic Computation [EMNLP2023]☆55Updated 10 months ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆197Updated last year
- ☆125Updated 9 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆160Updated last month
- Official Repo of LangSuitE☆78Updated 2 months ago
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆154Updated 6 months ago
- Code for STaR: Bootstrapping Reasoning With Reasoning (NeurIPS 2022)☆160Updated last year
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆219Updated 2 months ago
- Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)☆163Updated this week
- [NeurIPS 2023] Learning Transformer Programs☆157Updated 5 months ago
- Can Language Models Solve Olympiad Programming?☆100Updated 3 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆95Updated 2 months ago
- A repository for transformer critique learning and generation☆85Updated 11 months ago
- Super fast implementations of common benchmark text world games☆43Updated this week
- ☆79Updated last year
- Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"☆86Updated last year
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆77Updated 3 months ago