☆48Sep 5, 2024Updated last year
Alternatives and similar repositories for CMMMU
Users that are interested in CMMMU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accelerating the development of large multimodal models (LMMs) with lmms-eval☆14Oct 14, 2024Updated last year
- ☆56Mar 19, 2025Updated last year
- This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for E…☆569Feb 12, 2026Updated 3 months ago
- Code and data for EMNLP 2023 paper "Grounding Visual Illusions in Language: Do Vision-Language Models Perceive Illusions Like Humans?"☆15Jan 25, 2024Updated 2 years ago
- [ECCV 2024] "REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models"☆13Aug 6, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official repo for StableLLAVA☆95Dec 22, 2023Updated 2 years ago
- ☆39Aug 9, 2022Updated 3 years ago
- ☆19Aug 3, 2024Updated last year
- 学术主页 | Academic Page☆14May 7, 2026Updated last week
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆19Apr 16, 2024Updated 2 years ago
- Resources for paper "DialSummEval: Revisiting summarization evaluation for dialogues"☆14Jul 22, 2025Updated 9 months ago
- [ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection☆20Feb 5, 2023Updated 3 years ago
- [KDD'22] Partial Label Learning with Discrimination Augmentation☆10May 21, 2024Updated last year
- ☆128Jul 29, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆19Dec 6, 2023Updated 2 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- ☆36Sep 6, 2024Updated last year
- MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)☆326Jan 20, 2025Updated last year
- Official github repo of G-LLaVA☆149Feb 20, 2025Updated last year
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- 一个mmcv 的logger hook, 可以用来把模型结果推送到微信上☆21Oct 11, 2022Updated 3 years ago
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling☆523Nov 18, 2025Updated 6 months ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆17Jun 20, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…☆64May 15, 2025Updated last year
- Implementation for "The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer"☆83Oct 29, 2025Updated 6 months ago
- VeighNa框架的LevelDB数据库接口☆13Apr 23, 2023Updated 3 years ago
- Codebase for EnterpriseOps-Gym from ServiceNow☆91May 8, 2026Updated last week
- ☆27Jan 23, 2024Updated 2 years ago
- [ICML 2024] | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI☆119Apr 6, 2026Updated last month
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆297Mar 13, 2024Updated 2 years ago
- Extending Conformal Prediction to LLMs☆70Jun 21, 2024Updated last year
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆508Aug 9, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆138May 6, 2025Updated last year
- [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…☆27Jun 16, 2025Updated 11 months ago
- This is for C2D2 Dataset: A Resource for Analyzing Cognitive Distortions and Its Impact on Mental Health☆34Nov 10, 2023Updated 2 years ago
- ☆30Feb 10, 2025Updated last year
- ☆17Feb 22, 2024Updated 2 years ago
- ☆42Jul 15, 2025Updated 10 months ago
- Geometric Problem Solving Integrating FormalGeo Symbolic System and Hypergraph Neural Network.☆15Sep 23, 2025Updated 7 months ago