An extensible benchmark for evaluating large language models on planning
☆451Sep 17, 2025Updated 5 months ago
Alternatives and similar repositories for LLMs-Planning
Users that are interested in LLMs-Planning are comparing it to the libraries listed below
Sorting:
- The source code of the paper "Leveraging Pre-trained Large Language Models to Construct and Utilize World Models for Model-based Task Pla…☆107Aug 11, 2024Updated last year
- ☆67Jan 22, 2024Updated 2 years ago
- ☆451Sep 27, 2023Updated 2 years ago
- The Fast Downward domain-independent classical planning system☆377Feb 15, 2026Updated 2 weeks ago
- A collection of PDDL generators, some of which have been used to generate benchmarks for the International Planning Competition (IPC).☆149Jan 3, 2026Updated 2 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆3,187Feb 8, 2026Updated 3 weeks ago
- A library for advanced large language model reasoning☆2,336Jun 10, 2025Updated 8 months ago
- ☆22Oct 16, 2025Updated 4 months ago
- Large language models for PDDL domains☆44May 15, 2023Updated 2 years ago
- Must-read Papers on Large Language Model (LLM) Planning.☆435Jul 4, 2024Updated last year
- Reasoning with Language Model is Planning with World Model☆186Aug 25, 2023Updated 2 years ago
- This repository contains a collection of papers and resources on Reasoning in Large Language Models.☆567Nov 13, 2023Updated 2 years ago
- Official Code for "Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents"☆278May 16, 2022Updated 3 years ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆656Feb 8, 2026Updated 3 weeks ago
- ☆31Jun 12, 2024Updated last year
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆125Mar 31, 2025Updated 11 months ago
- Translating HTN planning problems to PDDL☆21Jul 7, 2021Updated 4 years ago
- ALFRED - A Benchmark for Interpreting Grounded Instructions for Everyday Tasks☆491Feb 5, 2026Updated last month
- Qualitative Numeric Planning☆10Dec 10, 2020Updated 5 years ago
- ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.☆337Dec 3, 2025Updated 3 months ago
- ☆133Jul 10, 2024Updated last year
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks☆20May 10, 2022Updated 3 years ago
- Implementation of "Describe, Explain, Plan and Select: Interactive Planning with Large Language Models Enables Open-World Multi-Task Agen…☆290Aug 3, 2023Updated 2 years ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆285May 26, 2024Updated last year
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆1,180Jan 16, 2025Updated last year
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆298Nov 16, 2024Updated last year
- Learning for effective and efficient bilevel planning☆137Feb 11, 2026Updated 3 weeks ago
- ☆21Dec 19, 2025Updated 2 months ago
- LLM Dynamic Planner - Combining LLM with PDDL Planners to solve an embodied task☆48Jan 4, 2025Updated last year
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆217Mar 26, 2025Updated 11 months ago
- ☆2,883Feb 20, 2025Updated last year
- Synthetic question-answering dataset to formally analyze the chain-of-thought output of large language models on a reasoning task.☆155Sep 9, 2025Updated 5 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆276Oct 27, 2025Updated 4 months ago
- ☆917Jul 24, 2024Updated last year
- 🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)☆24Oct 10, 2023Updated 2 years ago
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆482Nov 7, 2025Updated 3 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Dec 29, 2024Updated last year
- Convert a PDDL domain into an OpenAI Gym environment.☆263Jul 22, 2025Updated 7 months ago
- 🌍 PDDL instances covering the International Planning Competitions☆147Mar 11, 2021Updated 4 years ago