zjunlp/WKM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zjunlp/WKM)

zjunlp / WKM

[NeurIPS 2024] Agent Planning with World Knowledge Model

☆167

Alternatives and similar repositories for WKM

Users that are interested in WKM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yifan-Song793 / ETO
View on GitHub
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆168Oct 30, 2024Updated last year
zjunlp / KnowAgent
View on GitHub
[NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
☆260Jan 29, 2025Updated last year
Yu-Fangxu / FoR
View on GitHub
[ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples
☆126Jan 31, 2026Updated 5 months ago
WeiminXiong / IPR
View on GitHub
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
☆68Oct 18, 2024Updated last year
WeiminXiong / MPO
View on GitHub
MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)
☆81Aug 20, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WENGSYX / ControlLM
View on GitHub
ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…
☆21Nov 6, 2024Updated last year
zjunlp / AutoAct
View on GitHub
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆238Jan 13, 2025Updated last year
hrwise-nlp / AppBench
View on GitHub
This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
☆16Nov 4, 2024Updated last year
zjunlp / WorfBench
View on GitHub
[ICLR 2025] Benchmarking Agentic Workflow Generation
☆155Feb 19, 2025Updated last year
nirgreshler / bayesian-online-planning
View on GitHub
The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.
☆13Jun 17, 2024Updated 2 years ago
yjwtheonly / Scorpius
View on GitHub
Scorpius: Poisoning scientific knowledge using large language models
☆11Aug 3, 2024Updated last year
magicgh / Ask-before-Plan
View on GitHub
[EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning
☆24Jul 28, 2025Updated 11 months ago
ChangyuChen347 / MaskedThought
View on GitHub
[ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models
☆27Jul 9, 2024Updated 2 years ago
zjunlp / LightThinker
View on GitHub
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆165Jun 22, 2026Updated last month
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
JingyangYi / ShorterBetter
View on GitHub
☆18Jul 31, 2025Updated 11 months ago
allenai / ScienceWorld
View on GitHub
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
☆368Dec 3, 2025Updated 7 months ago
karthikv792 / LLMs-Planning
View on GitHub
An extensible benchmark for evaluating large language models on planning
☆470Jun 2, 2026Updated last month
alfworld / alfworld
View on GitHub
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
☆810Feb 8, 2026Updated 5 months ago
ByteDance-Seed / Agent-R
View on GitHub
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆174Oct 20, 2025Updated 9 months ago
XiaojuanTang / Mars
View on GitHub
a benchmark to evaluate the situated inductive reasoning
☆16Jan 7, 2025Updated last year
RishiHazra / saycanpay
View on GitHub
Official code release of AAAI 2024 paper SayCanPay.
☆54Oct 22, 2025Updated 9 months ago
VityaVitalich / STASC
View on GitHub
[ICLR 2025 SSI-FM] Self-Taught Self-Correction for Small Language Models
☆11Sep 19, 2025Updated 10 months ago
youngsoul0731 / FLORA-Bench
View on GitHub
[Arxiv 2025] Official code and datasets of paper: GNNs as Predictors of Agentic Workflow Performances
☆20Jan 15, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
1989Ryan / llm-mcts
View on GitHub
[NeurIPS 2023] We use large language models as commonsense world model and heuristic policy within Monte-Carlo Tree Search, enabling bett…
☆303Nov 16, 2024Updated last year
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆15Mar 25, 2025Updated last year
NVlabs / progprompt-vh
View on GitHub
ProgPrompt for Virtualhome
☆153Jun 23, 2023Updated 3 years ago
web-arena-x / visualwebarena
View on GitHub
VisualWebArena is a benchmark for multimodal agents.
☆484Nov 9, 2024Updated last year
chentong0 / rl-binary-rar
View on GitHub
Official repo for "Binary Retrieval-augmented Reward Mitigates Hallucinations"
☆15Nov 13, 2025Updated 8 months ago
zjunlp / WorldMind
View on GitHub
Aligning Agentic World Models via Knowledgeable Experience Learning
☆37May 15, 2026Updated 2 months ago
hanqi-qi / Mirror
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
Aaron617 / ICLR-2025-Submissions-Agent
View on GitHub
ICLR 2025 Agent-Related Papers
☆75Nov 14, 2024Updated last year
OSU-NLP-Group / LLM-Planner
View on GitHub
[ICCV'23] LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models
☆229Mar 26, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
zhao-zilong / ssc-cot
View on GitHub
Git for "Stepwise Self-Consistent Mathematical Reasoning with Large Language Models"
☆12Nov 26, 2024Updated last year
WooooDyy / AgentGym
View on GitHub
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…
☆817May 30, 2026Updated last month
Berkeley-NLP / Agent-Eval-Refine
View on GitHub
Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]
☆149Nov 26, 2024Updated last year
flowersteam / WorldLLM
View on GitHub
LLM as World Models using Bayesian inference
☆21May 27, 2025Updated last year
aialt / awesome-mobile-agents
View on GitHub
✨✨Latest Papers and Datasets on Mobile and PC GUI Agent
☆159Nov 29, 2024Updated last year
heaplax / ARMAP
View on GitHub
☆29Jun 5, 2025Updated last year
haotiansun14 / AdaPlanner
View on GitHub
AdaPlanner: Language Models for Decision Making via Adaptive Planning from Feedback
☆125Mar 31, 2025Updated last year