MAT: Multi-modal Agent Tuning 🔥 ICLR 2025 (Spotlight)
☆93Dec 18, 2025Updated 5 months ago
Alternatives and similar repositories for MAT-Agent
Users that are interested in MAT-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Nov 5, 2024Updated last year
- 【ICLR 2025 🔥】MMKE-Bench, a challenging benchmark for evaluating diverse semantic editing in real-world scenarios.☆23Apr 19, 2025Updated last year
- ☆68Dec 5, 2025Updated 6 months ago
- ☆30Jun 19, 2024Updated last year
- [AAAI 2026]Release of code, datasets and model for our work TongUI: Internet-Scale Trajectories from Multimodal Web Tutorials for General…☆111Dec 1, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A powerful automation agent for macOS that enables natural language control of various system applications and services. This agent allow…☆60Jun 5, 2025Updated last year
- ☆55Oct 3, 2024Updated last year
- Code for ACM MM 2024 paper "A Picture Is Worth a Graph: A Blueprint Debate Paradigm for Multimodal Reasoning"☆19Dec 5, 2024Updated last year
- ☆22May 23, 2025Updated last year
- ICLR 2026: Agent-X Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks☆42Apr 28, 2026Updated last month
- Analyzing LLM Alignment via Token distribution shift☆18Jan 26, 2024Updated 2 years ago
- (ICLR 2025 Spotlight) Official code repository for Interleaved Scene Graph.☆32Aug 7, 2025Updated 10 months ago
- ☆78Apr 15, 2026Updated last month
- Lab tasks for the course on "Data Engineering for Machine Learning"☆10May 1, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆17Nov 1, 2024Updated last year
- 基于开源软件anki的二次开发,简化了部分操作,“傻瓜式”英语学习软件☆15Dec 8, 2022Updated 3 years ago
- ☆69Jun 2, 2026Updated last week
- ☆47Aug 26, 2025Updated 9 months ago
- MemoryEQA☆26May 4, 2026Updated last month
- General-purpose Visual Understanding Evaluation☆20Dec 21, 2023Updated 2 years ago
- code of the CVPR 2020 paper "Learning to Optimize on SPD Manifolds"☆13Sep 12, 2020Updated 5 years ago
- ☆10Nov 21, 2023Updated 2 years ago
- R1-Vision: Let's first take a look at the image☆48Feb 16, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆17Apr 19, 2021Updated 5 years ago
- The reproduce of paper "Continual Vision-Language Representation Learning with Off-Diagonal Information ".(Mod-X)☆12Oct 31, 2023Updated 2 years ago
- [MTI-LLM@NeurIPS 2025] Official implementation of "PyVision: Agentic Vision with Dynamic Tooling."☆161Jul 22, 2025Updated 10 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆32Jun 5, 2025Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…☆1,480Mar 9, 2026Updated 3 months ago
- [ICLR 2025] "Noisy Test-Time Adaptation in Vision-Language Models"☆16Feb 22, 2025Updated last year
- ☆49Oct 28, 2025Updated 7 months ago
- ☆129Jul 22, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆23Aug 30, 2025Updated 9 months ago
- Reasoning Activation in LLMs via Small Model Transfer (NeurIPS 2025)☆22Oct 16, 2025Updated 7 months ago
- ReproZip for the Preservation of Web Applications☆17May 6, 2024Updated 2 years ago
- Framework of DataLog Neural Program Synthesis☆27Apr 2, 2019Updated 7 years ago
- [TPAMI 2025, Highly Cited Paper] Divide-and-Conquer: Confluent Triple-Flow Network for RGB-T Salient Object Detection☆22Jul 10, 2025Updated 11 months ago
- ☆14Jun 21, 2019Updated 6 years ago
- Code for COLING 2020 paper "Controllable Abstractive Sentence Summarization with Guiding Entities"☆12Dec 24, 2020Updated 5 years ago