WeiminXiong/MPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WeiminXiong/MPO)

WeiminXiong / MPO

MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)

☆81

Alternatives and similar repositories for MPO

Users that are interested in MPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yifan-Song793 / GoodBadGreedy
View on GitHub
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism
☆31Jul 17, 2024Updated 2 years ago
PKU-TANGENT / ConFiguRe
View on GitHub
Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"
☆12Jul 27, 2023Updated 3 years ago
WeiminXiong / RationaleCL
View on GitHub
Rationale-enhanced language models are better continual relation learners (EMNLP 2023 Main Conference)
☆12Oct 11, 2023Updated 2 years ago
WeiminXiong / IPR
View on GitHub
Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)
☆68Oct 18, 2024Updated last year
Yifan-Song793 / ETO
View on GitHub
Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
☆168Oct 30, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
F2-Song / ICDPO
View on GitHub
The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…
☆16Feb 15, 2024Updated 2 years ago
F2-Song / Weak-to-Strong-Decoding
View on GitHub
The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"
☆22Jun 26, 2025Updated last year
chenllliang / MMEvalPro
View on GitHub
[NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs
☆25Sep 26, 2024Updated last year
RenShuhuai-Andy / my-tools
View on GitHub
my commonly-used tools
☆64Jan 7, 2025Updated last year
KbsdJames / Omni-MATH
View on GitHub
The official repository of the Omni-MATH benchmark.
☆94Dec 22, 2024Updated last year
zjunlp / WKM
View on GitHub
[NeurIPS 2024] Agent Planning with World Knowledge Model
☆167Dec 17, 2024Updated last year
Zce1112zslx / ChID_baseline
View on GitHub
计算语言学22-23学年秋季学期课程大作业baseline实现
☆38Dec 8, 2022Updated 3 years ago
FranxYao / Retrieval-Head-with-Flash-Attention
View on GitHub
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Jun 15, 2024Updated 2 years ago
lancopku / clip-openness
View on GitHub
[ACL 2023] Delving into the Openness of CLIP
☆24Jan 11, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year
Lux0926 / ASPRM
View on GitHub
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
☆10Mar 2, 2025Updated last year
ByteDance-Seed / Agent-R
View on GitHub
Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"
☆174Oct 20, 2025Updated 9 months ago
dqxiu / CaliNet
View on GitHub
☆32Oct 17, 2022Updated 3 years ago
pkunlp-icler / PCA-EVAL
View on GitHub
[ACL 2024] PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
☆107Mar 14, 2024Updated 2 years ago
Wangpeiyi9979 / ESD
View on GitHub
Code for NAACL2022 Long Paper "An Enhanced Span-based Decomposition Method for Few-Shot Sequence Labeling"
☆27Nov 9, 2022Updated 3 years ago
ai-nikolai / StateAct
View on GitHub
[REALM25 @ ACL25] - "StateAct" Official Paper Repo (SOTA LLM Agent)
☆18Feb 27, 2026Updated 5 months ago
pkunlp-icler / SCL-RAI
View on GitHub
Code for "SCL-RAI: Span-based Contrastive Learning with Retrieval Augmented Inference for Unlabeled Entity Problem in NER" @COLING-2022
☆11Aug 20, 2022Updated 3 years ago
KbsdJames / Awesome-LLM-Preference-Learning
View on GitHub
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
☆192Oct 28, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Wangpeiyi9979 / HCL-Text2AMR
View on GitHub
Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"
☆13Jun 1, 2022Updated 4 years ago
SparkJiao / dpo-trajectory-reasoning
View on GitHub
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆84Jan 14, 2025Updated last year
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago
lancopku / MUKI
View on GitHub
[Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models
☆19Mar 16, 2023Updated 3 years ago
M3-IT / YING-VLM
View on GitHub
Vision Large Language Models trained on M3IT instruction tuning dataset
☆17Aug 16, 2023Updated 2 years ago
YucanGuo / RouteRAG
View on GitHub
RouteRAG: Efficient Retrieval-Augmented Generation from Text and Graph via Reinforcement Learning
☆36Jul 1, 2026Updated 3 weeks ago
XiaoMi / DetermLR
View on GitHub
Open source code for paper
☆14May 27, 2024Updated 2 years ago
chenllliang / ParetoMNMT
View on GitHub
Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023
☆17Sep 27, 2023Updated 2 years ago
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LCLM-Horizon / A-Comprehensive-Survey-For-Long-Context-Language-Modeling
View on GitHub
A Comprehensive Survey on Long Context Language Modeling
☆252May 29, 2026Updated 2 months ago
Koreyoshi01 / VISD
View on GitHub
This repository is the official implementation for VISD.
☆22May 17, 2026Updated 2 months ago
PKU-AICare / ConfAgents
View on GitHub
ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis
☆15Jul 22, 2026Updated last week
Mr-Loevan / FAST
View on GitHub
[NeurIPS 2025 Spotlight] Fast-Slow Thinking GRPO for Large Vision-Language Model Reasoning
☆55Apr 16, 2026Updated 3 months ago
chenyiqun / MMOA-RAG
View on GitHub
This is the code of MMOA-RAG.
☆113May 11, 2025Updated last year
T-Lab-CUHKSZ / G2RPO-A
View on GitHub
[ACL 2026] G2RPO-A: Guided Group Relative Policy Optimization with Adaptive Guidance
☆16May 20, 2026Updated 2 months ago
heaplax / ARMAP
View on GitHub
☆29Jun 5, 2025Updated last year