Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.
☆79Oct 31, 2025Updated 4 months ago
Alternatives and similar repositories for MATPO
Users that are interested in MATPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 5 months ago
- Evaluating Durability: Benchmark Insights into Multimodal Watermarking☆12Jun 7, 2024Updated last year
- ☆18Aug 19, 2025Updated 7 months ago
- Official Implementation of wd1☆24Sep 25, 2025Updated 6 months ago
- [CVPR 2026] LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling☆210Mar 9, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICLR 2026] Meta-RL Induces Exploration in Language Agents☆33Feb 1, 2026Updated last month
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- [ICLR 2025] Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs☆19Mar 20, 2025Updated last year
- [AAAI26] Trade-offs in Large Reasoning Models: An Empirical Analysis of Deliberative and Adaptive Reasoning over Foundational Capabilitie…☆10Feb 7, 2026Updated last month
- The official implemention of "Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"☆24Feb 4, 2026Updated last month
- The CodeInsight dataset is designed for code generation tasks, providing developers with expert-curated examples that bridge the gap betw…☆14Oct 22, 2024Updated last year
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆48Oct 16, 2025Updated 5 months ago
- Finetune GPT2 for text summarization☆17Aug 16, 2021Updated 4 years ago
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆243Aug 27, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆17Jul 12, 2025Updated 8 months ago
- STAR: Similarity-guided Teacher-Assisted Refinement for Super-Tiny Function Calling Models☆42Updated this week
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- Code and data for paper "Large language models can rate news outlet credibility"☆13Aug 10, 2024Updated last year
- open IPython notebooks for the book of Scientific Computing with Python☆11Jul 16, 2015Updated 10 years ago
- ☆15Feb 26, 2025Updated last year
- Code and data for paper "Can LLM Watermarks Robustly Prevent Unauthorized Knowledge Distillation?". (ACL 2025 Main)☆22Jun 18, 2025Updated 9 months ago
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆38Feb 6, 2026Updated last month
- ☆11Mar 13, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Apr 24, 2022Updated 3 years ago
- ☆18Mar 30, 2025Updated 11 months ago
- A Comprehensive Review of 3D Object Detection in Autonomous Driving: Technological Advances and Future Directions☆34Sep 3, 2024Updated last year
- Towards Scalable Language-Image Pre-training for 3D Medical Imaging [TMLR 2026]☆45Updated this week
- ☆10Jun 21, 2021Updated 4 years ago
- A dataset and CLIP baseline for unrepresentative news thumbnail detection (ACL 2022 workshop)☆12May 26, 2022Updated 3 years ago
- [ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".☆24Mar 16, 2025Updated last year
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆68Mar 20, 2026Updated last week
- personalized product search with product reviews☆17Feb 1, 2023Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- This is the source code of IJCNN 2023 paper TieFake: Title-Text Similarity and Emotion-Aware Fake News Detection (TieFake).☆16Dec 21, 2023Updated 2 years ago
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆81Oct 29, 2025Updated 4 months ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated last month
- official implementation of RoSAS: Deep Semi-supervised Anomaly Detection with Contamination-resilient Continuous Supervision☆11Jul 18, 2023Updated 2 years ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆107Sep 18, 2025Updated 6 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆55Aug 28, 2025Updated 7 months ago
- Convex Formulation of Multiple Instance Learning from Positive and Unlabeled Bags☆10Apr 28, 2018Updated 7 years ago