[NeurIPS 2024] Agent Planning with World Knowledge Model
☆165Dec 17, 2024Updated last year
Alternatives and similar repositories for WKM
Users that are interested in WKM are comparing it to the libraries listed below
Sorting:
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆160Oct 30, 2024Updated last year
- [NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆257Jan 29, 2025Updated last year
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆122Jan 31, 2026Updated last month
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆76Aug 20, 2025Updated 7 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆19Nov 6, 2024Updated last year
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆235Jan 13, 2025Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆146Feb 19, 2025Updated last year
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Scorpius: Poisoning scientific knowledge using large language models☆11Aug 3, 2024Updated last year
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆27Jul 9, 2024Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆21Jul 28, 2025Updated 7 months ago
- [NeurIPS 2024 Oral] "Bayesian-Guided Label Mapping for Visual Reprogramming"☆12Dec 20, 2024Updated last year
- An extensible benchmark for evaluating large language models on planning☆455Sep 17, 2025Updated 6 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆54Oct 22, 2025Updated 5 months ago
- [ICML 2025] "From Debate to Equilibrium: Belief-Driven Multi-Agent LLM Reasoning via Bayesian Nash Equilibrium"☆34Nov 23, 2025Updated 3 months ago
- ALFWorld: Aligning Text and Embodied Environments for Interactive Learning☆679Feb 8, 2026Updated last month
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆24Jan 6, 2026Updated 2 months ago
- [ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models☆11Sep 19, 2025Updated 6 months ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆170Oct 20, 2025Updated 5 months ago
- [NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…☆300Nov 16, 2024Updated last year
- ProgPrompt for Virtualhome☆148Jun 23, 2023Updated 2 years ago
- VisualWebArena is a benchmark for multimodal agents.☆445Nov 9, 2024Updated last year
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 5 months ago
- [ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models☆218Mar 26, 2025Updated 11 months ago
- Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"☆12Nov 26, 2024Updated last year
- ✨✨Latest Papers and Datasets on Mobile and PC GUI Agent☆152Nov 29, 2024Updated last year
- ICLR 2025 Agent-Related Papers☆76Nov 14, 2024Updated last year
- This repository contains the code for the paper: Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models☆20Apr 27, 2024Updated last year
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆148Nov 26, 2024Updated last year
- ☆28Jun 5, 2025Updated 9 months ago
- ☆14Feb 26, 2024Updated 2 years ago
- 🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…☆392Feb 17, 2026Updated last month
- Official code for the paper: WALL-E: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents☆58Dec 3, 2025Updated 3 months ago
- ☆18Nov 30, 2025Updated 3 months ago
- AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback☆125Mar 31, 2025Updated 11 months ago