Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.
☆79Oct 31, 2025Updated 5 months ago
Alternatives and similar repositories for MATPO
Users that are interested in MATPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Oct 9, 2025Updated 6 months ago
- 本插件包含一些有趣的Word小工具,如规划Pre时间、提取Word中图片的原图、便捷的API翻译和GPT for Word。☆11Mar 13, 2025Updated last year
- Official Implementation of wd1☆26Sep 25, 2025Updated 6 months ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- [CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling☆217Apr 10, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2026] OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe☆156Mar 30, 2026Updated 2 weeks ago
- The Official Repo for Paper: Aligning Clinical Needs and AI Capabilities: A Survey on LLMs for Medical Reasoning☆22Apr 7, 2026Updated last week
- [ICLR 2026] Meta-RL Induces Exploration in Language Agents☆35Feb 1, 2026Updated 2 months ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆36Apr 9, 2026Updated last week
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated 2 months ago
- [CVPRW 2024] Conformal prediction for uncertainty quantification in image segmentation☆26Dec 9, 2024Updated last year
- ☆12Oct 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EANN(Pytorch)☆10Mar 12, 2022Updated 4 years ago
- This repository contains the code for the paper: Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models☆21Apr 27, 2024Updated last year
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆244Aug 27, 2025Updated 7 months ago
- ☆17Jul 12, 2025Updated 9 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆42Mar 23, 2026Updated 3 weeks ago
- Game 2048 By HTML5☆11Mar 13, 2015Updated 11 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- [MICCAI 2024] MoRA: LoRA Guided Multi-Modal Disease Diagnosis with Missing Modality☆12Sep 26, 2025Updated 6 months ago
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Github of "Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework"☆17Jan 4, 2026Updated 3 months ago
- A Paper List for Geo-localization Research☆16Sep 2, 2024Updated last year
- ☆11Mar 13, 2023Updated 3 years ago
- ☆10Apr 24, 2022Updated 3 years ago
- ☆18Mar 30, 2025Updated last year
- Standardizing environment infrastructure with Strands Agents — step, observe, reward.☆44Updated this week
- [ICLR 2026] Quantile Advantage Estimation for Entropy-Safe Reasoning☆24Oct 14, 2025Updated 6 months ago
- ☆10Jun 21, 2021Updated 4 years ago
- Optimization Case Studies: Generic Time Scheduling Problem (GTSP), Resource-Constrained Project Scheduling Problem (RCPSP) with Pulse Var…☆11Nov 7, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Towards Scalable Language-Image Pre-training for 3D Medical Imaging [TMLR 2026]☆48Apr 2, 2026Updated 2 weeks ago
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆68Apr 4, 2026Updated last week
- personalized product search with product reviews☆17Feb 1, 2023Updated 3 years ago
- This is the source code of IJCNN 2023 paper TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection (TieFake).☆16Dec 21, 2023Updated 2 years ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆81Oct 29, 2025Updated 5 months ago
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆12Jul 18, 2023Updated 2 years ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated 2 months ago