mzf666/MATPO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mzf666/MATPO)

mzf666 / MATPO

Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.

☆82

Alternatives and similar repositories for MATPO

Users that are interested in MATPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mzf666 / sparsity-indexed-ode
View on GitHub
Official implementation of ``Neural Pruning via Sparsity-indexed ODE: A Continuous Sparsity Viewpoint"
☆11Jun 15, 2023Updated 3 years ago
pettingllms-ai / PettingLLMs
View on GitHub
[ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system; [arxiv] MetaAgent-X: End-to-End Reinforcement Learning Automatic Mult…
☆206May 15, 2026Updated 2 months ago
xxyQwQ / CoMAS
View on GitHub
Implementation for the paper "CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards".
☆53Jan 26, 2026Updated 6 months ago
real-absolute-AI / LongRLVR
View on GitHub
[ICLR 2026] LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards.
☆19Mar 16, 2026Updated 4 months ago
Hoar012 / TDC-Video
View on GitHub
Official implementation of TDC.
☆15Jul 22, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Infinity-AILab / DeepResearchEval
View on GitHub
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.
☆142Feb 10, 2026Updated 5 months ago
MiroMindAI / MiroMind-M1
View on GitHub
MiroMind-M1 is a fully open-source series of reasoning language models built on Qwen-2.5, focused on advancing mathematical reasoning.
☆280Aug 12, 2025Updated 11 months ago
EvolvingLMMs-Lab / LongVT
View on GitHub
[CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
☆257Jun 24, 2026Updated last month
Wenchuan-Zhang / Patho-R1
View on GitHub
[AAAI-2026] Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner
☆100Nov 17, 2025Updated 8 months ago
jins7 / LatentEvolve
View on GitHub
☆27Oct 9, 2025Updated 9 months ago
EvolvingLMMs-Lab / OpenMMReasoner
View on GitHub
[CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
☆164Mar 30, 2026Updated 3 months ago
syr-cn / ReMemR1
View on GitHub
Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents
☆43Apr 13, 2026Updated 3 months ago
LengSicong / MMR1
View on GitHub
[CVPR 2026] MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
☆217Sep 26, 2025Updated 10 months ago
LiangThree / MCMA
View on GitHub
☆16Jan 12, 2026Updated 6 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
jwliao-ai / MARFT
View on GitHub
☆86May 14, 2026Updated 2 months ago
UniX-AI-Lab / WorldReasonBench
View on GitHub
WorldReasonBench: Human-Aligned Stress Testing of Video Generators as Future World-State Predictors
☆22May 19, 2026Updated 2 months ago
MiroMindAI / MiroRL
View on GitHub
MiroRL is an MCP-first reinforcement learning framework for deep research agent.
☆246Aug 27, 2025Updated 11 months ago
SLIT-AI / WRPO
View on GitHub
[ICLR 2025] Weighted-Reward Preference Optimization for Implicit Model Fusion
☆14Mar 17, 2025Updated last year
NTU-Siqiang-Group / AsterVec
View on GitHub
Embedded on-device vector database for AI agent memory and local RAG — disk-based HNSW in an LSM-tree, C++/Python, optimized for low memo…
☆32Jul 21, 2026Updated last week
UCSC-VLAA / MedVLSynther
View on GitHub
[ICLR'26] MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
☆19Nov 1, 2025Updated 8 months ago
Wenchuan-Zhang / Patho-AgenticRAG
View on GitHub
[AAAI-2026] Patho-AgenticRAG: Towards Multimodal Agentic Retrieval-Augmented Generation for Pathology VLMs via Reinforcement Learning
☆64Nov 17, 2025Updated 8 months ago
ZJU-ACES-ISE / ChatUITest
View on GitHub
Under construction
☆14Jan 15, 2025Updated last year
gdmnl / Spectral-GNN-Benchmark
View on GitHub
A PyG-based package of spectral GNNs with benchmark evaluations (SIGMOD 2026).
☆19Aug 20, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
TsinghuaC3I / MARTI
View on GitHub
A Framework for LLM-based Multi-Agent Reinforced Training and Inference
☆540Apr 14, 2026Updated 3 months ago
AQ-MedAI / MrlX
View on GitHub
MrlX: A Multi-Agent Reinforcement Learning Framework
☆215Jan 19, 2026Updated 6 months ago
Alibaba-Quark / SSP
View on GitHub
Search Self-Play: Pushing the Frontier of Agent Capability without Supervision
☆105Updated this week
usail-hkust / Agent-Omit
View on GitHub
Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Reinforcement Learning
☆32May 11, 2026Updated 2 months ago
xlyu0106 / VisMem
View on GitHub
☆91Feb 5, 2026Updated 5 months ago
junmokane / spatially-aware-transformer
View on GitHub
☆10Dec 10, 2024Updated last year
YujunZhou / EVOL-RL
View on GitHub
Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).
☆51Mar 31, 2026Updated 3 months ago
UCSC-VLAA / MedVLThinker
View on GitHub
[ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
☆60Dec 21, 2025Updated 7 months ago
1586951660 / bs
View on GitHub
老年人健康监测系统
☆10Jun 17, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Yingjia-Wan / FaStfact
View on GitHub
Code repo for FaStfact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.
☆33Nov 5, 2025Updated 8 months ago
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆39Feb 25, 2026Updated 5 months ago
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,026Jul 15, 2026Updated 2 weeks ago
qualidea1217 / HiPRAG
View on GitHub
HiPRAG (Hierarchical Process Rewards for Efficient Agentic Retrieval Augmented Generation) is a reinforcement learning method designed fo…
☆26Oct 10, 2025Updated 9 months ago
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,093Jul 13, 2026Updated 2 weeks ago
jjovalle99 / Speculative-RAG
View on GitHub
☆33Aug 28, 2024Updated last year
yczhou001 / MAM
View on GitHub
MAM: ModularMulti-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration
☆53Apr 3, 2026Updated 3 months ago