TIGER-AI-Lab/MEGA-Bench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TIGER-AI-Lab/MEGA-Bench)

TIGER-AI-Lab / MEGA-Bench

This repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR 2025]

☆81

Alternatives and similar repositories for MEGA-Bench

Users that are interested in MEGA-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

3dlg-hcvc / SemLayoutDiff
View on GitHub
Official PyTorch implementation of "SemLayoutDiff: Semantic Layout Diffusion for 3D Indoor Scene Generation"
☆33Jun 15, 2026Updated last month
3dlg-hcvc / OPDMulti
View on GitHub
☆23Apr 4, 2026Updated 3 months ago
mingrui-zhao / SweepNet
View on GitHub
☆27Jul 22, 2025Updated 11 months ago
3dlg-hcvc / paris
View on GitHub
[ICCV 2023] Official implementation of the paper "PARIS: Part-level Reconstruction and Motion Analysis for Articulated Objects"
☆86Feb 16, 2025Updated last year
3dlg-hcvc / NuiScene
View on GitHub
[ICCV 2025] NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes
☆92Oct 26, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hanhung / TGNN
View on GitHub
☆26Mar 15, 2022Updated 4 years ago
3dlg-hcvc / OPD
View on GitHub
[ECCV 2022, Oral] OPD: Single-view 3D Openable Part Detection
☆37Jul 2, 2026Updated 2 weeks ago
bioscan-ml / clibd
View on GitHub
A multimodal model bridging vision and genomics for biodiversity monitoring at scale.
☆21May 11, 2026Updated 2 months ago
3dlg-hcvc / revsi
View on GitHub
[ICML 2026] ReVSI: Rebuilding Visual Spatial Intelligence Evaluation for Accurate Assessment of VLM 3D Reasoning
☆81Jul 8, 2026Updated last week
TIGER-AI-Lab / VideoGenHub
View on GitHub
A one-stop library to standardize the inference and evaluation of all the conditional video generation models.
☆50Feb 13, 2025Updated last year
multimodal-art-projection / IV-Bench
View on GitHub
☆14Apr 23, 2025Updated last year
JinjieNi / MixEval-X
View on GitHub
The official github repo for MixEval-X, the first any-to-any, real-world benchmark.
☆17Feb 15, 2025Updated last year
3dlg-hcvc / smc
View on GitHub
[3DV 2025] Official implementation of the paper "SceneMotifCoder: Example-driven Visual Program Learning for Generating 3D Object Arrange…
☆47Oct 14, 2025Updated 9 months ago
psunlpgroup / VisOnlyQA
View on GitHub
This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…
☆29Jul 9, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
TIGER-AI-Lab / GenAI-Arena
View on GitHub
Interface for GenAI-Arena [NeurIPS24]
☆16Feb 27, 2024Updated 2 years ago
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
CMMMU-Benchmark / CMMMU
View on GitHub
☆48Sep 5, 2024Updated last year
TIGER-AI-Lab / ABC
View on GitHub
ABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]
☆19Aug 21, 2025Updated 10 months ago
TIGER-AI-Lab / AceCoder
View on GitHub
The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]
☆100Apr 9, 2025Updated last year
husterpzh / PSSR
View on GitHub
Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration （CVPR2023）"
☆10May 15, 2024Updated 2 years ago
jdf-prog / LLM-Engines
View on GitHub
☆50Jun 7, 2025Updated last year
UCSC-VLAA / ReasoningEval
View on GitHub
Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
☆43Jun 6, 2025Updated last year
TIGER-AI-Lab / VisCoder
View on GitHub
The official code of "VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation" [EMNLP25]
☆19Sep 21, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
swordlidev / Evaluation-Multimodal-LLMs-Survey
View on GitHub
A Survey on Benchmarks of Multimodal Large Language Models
☆156Jul 13, 2026Updated last week
TIGER-AI-Lab / VL-Rethinker
View on GitHub
The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]
☆189Jun 5, 2025Updated last year
OpenGVLab / V2PE
View on GitHub
[ICCV2025] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding
☆60Apr 4, 2026Updated 3 months ago
TIGER-AI-Lab / VISTA
View on GitHub
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆20Feb 27, 2025Updated last year
3dlg-hcvc / singapo
View on GitHub
[ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects
☆95Feb 17, 2026Updated 5 months ago
TIGER-AI-Lab / ImagenHub
View on GitHub
A one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]
☆179Dec 2, 2025Updated 7 months ago
TIGER-AI-Lab / StructEval
View on GitHub
Evaluating LLMs' abilities to generate structural output [TMLR2025]
☆23Jun 12, 2026Updated last month
yunfeixie233 / ViGaL
View on GitHub
☆70Feb 4, 2026Updated 5 months ago
MMMU-Benchmark / MMMU
View on GitHub
This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for E…
☆589Feb 12, 2026Updated 5 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
mikejqzhang / SituatedQA
View on GitHub
☆23Aug 10, 2022Updated 3 years ago
3dlg-hcvc / tricolo
View on GitHub
[WACV 2024] TriCoLo: Trimodal Contrastive Loss for Text to Shape Retrieval
☆28Jul 13, 2025Updated last year
MME-Benchmarks / MME-RealWorld
View on GitHub
✨✨ [ICLR 2025] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
☆160Oct 21, 2025Updated 9 months ago
kongdai123 / consistency2
View on GitHub
☆16Jun 14, 2024Updated 2 years ago
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆227Nov 27, 2025Updated 7 months ago
cfeng16 / GPS2Pix
View on GitHub
[CVPR 2025] GPS as a Control Signal for Image Generation
☆25Mar 18, 2025Updated last year
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated 2 years ago