antgroup/OmniBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/antgroup/OmniBench)

antgroup / OmniBench

[ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities"

☆22

Alternatives and similar repositories for OmniBench

Users that are interested in OmniBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lliar-liar / Daily-Omni
View on GitHub
This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities
☆42Apr 28, 2026Updated 2 months ago
BRZ911 / ViTCoT
View on GitHub
[ACM MM 2025] ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models
☆18Jul 15, 2025Updated last year
gvanhorn38 / iNatSounds
View on GitHub
iNatSounds Dataset
☆21Oct 30, 2024Updated last year
sen-ye / PKU-CSSummerCamp-OJ
View on GitHub
classification and solutions for PKU-CSSummerCamp-OnlineJudge
☆26Jul 1, 2023Updated 3 years ago
ttgeng233 / LongVALE
View on GitHub
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))
☆61Jun 9, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Exgc / R1V-Free
View on GitHub
R1V, trained with AI feedback, answers open-ended visual questions.
☆14Apr 12, 2025Updated last year
hwanyu112 / VIBE-Benchmark
View on GitHub
☆27Feb 3, 2026Updated 5 months ago
vl-rewardbench / VL_RewardBench
View on GitHub
☆29Jul 23, 2025Updated last year
malthee / evolutionary-diffusion
View on GitHub
Applying Evolutionary Computing to Embeddings of Diffusion Models
☆16Jun 6, 2026Updated last month
showlab / GUI-Narrator
View on GitHub
Repository of GUI Action Narrator
☆13Apr 8, 2025Updated last year
LINs-lab / GMem
View on GitHub
[Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models
☆43Mar 11, 2025Updated last year
llyx97 / video_reason_bench
View on GitHub
[ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…
☆41Jan 30, 2026Updated 5 months ago
GraphPKU / number_cookbook
View on GitHub
Official repository for the paper Number Cookbook: Number Understanding of Language Models and How to Improve It.
☆21Mar 31, 2025Updated last year
ALT-JS / OthelloSAE
View on GitHub
CS194-196 Course Project
☆14Feb 20, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆18Jun 30, 2026Updated 3 weeks ago
MM-FIRE / FIRE
View on GitHub
☆13Nov 5, 2024Updated last year
microsoft / Industrial-Foundation-Models
View on GitHub
Dedicated to building industrial foundation models for universal data intelligence across industries.
☆63Aug 19, 2024Updated last year
jszheng21 / RACE
View on GitHub
RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.
☆14Oct 12, 2024Updated last year
symanto-research / merge-tokenizers
View on GitHub
Package to align tokens from different tokenizations.
☆16Mar 25, 2024Updated 2 years ago
LyWang12 / CUTI-Domain
View on GitHub
☆15Feb 11, 2025Updated last year
mbzuai-oryx / LongShOT
View on GitHub
A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos
☆21Jun 20, 2026Updated last month
aimagelab / COGT
View on GitHub
[ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding
☆10Apr 15, 2025Updated last year
SII-dannyXSC / Human2Robot
View on GitHub
AAAI 2026 Oral
☆18Dec 23, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
DUTIR-YSQ / MultiMM
View on GitHub
☆21Dec 7, 2025Updated 7 months ago
yulonghui / MOCA
View on GitHub
Official implementation of "Continual Learning by Modeling Intra-Class Variation" (MOCA). [TMLR 2023]
☆16Mar 3, 2023Updated 3 years ago
Yxxxb / VoCo-LLaMA
View on GitHub
[CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".
☆205Jun 18, 2025Updated last year
mlsys-io / helium_demo
View on GitHub
☆23May 2, 2026Updated 2 months ago
Parker-rfu / SeLaReasoning
View on GitHub
[ACL 2026 oral] SeLaR: Selective Latent Reasoning in Large Language Models
☆21Apr 25, 2026Updated 3 months ago
Gary-code / KECVQG
View on GitHub
[ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"
☆10Sep 3, 2024Updated last year
huawei-lin / Agent-Omni
View on GitHub
The official implementation for the paper "Agent-Omni: Test-Time Multimodal Reasoning via Model Coordination for Understanding Anything".
☆23Nov 5, 2025Updated 8 months ago
zjr2000 / REVERIE
View on GitHub
[ECCV2024] Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
☆20Jul 17, 2024Updated 2 years ago
THUDM / SceneGenAgent
View on GitHub
[ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent
☆37Nov 29, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
M3RG-IITD / MaScQA
View on GitHub
☆18Jul 25, 2025Updated last year
WolodjaZ / MSAE
View on GitHub
Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)
☆28Jan 17, 2026Updated 6 months ago
PlusLabNLP / VISCO
View on GitHub
[CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning
☆13Jun 7, 2025Updated last year
NgCafai / Transformer
View on GitHub
Transformer: PyTorch Implementation of "Attention Is All You Need"
☆15Dec 13, 2023Updated 2 years ago
THUNLP-MT / EscapeCraft
View on GitHub
Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.
☆39Jul 7, 2025Updated last year
yonseivnl / vlm-rlaif
View on GitHub
ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
☆77Sep 12, 2024Updated last year
perronea / 3D_CycleGAN
View on GitHub
GAN for image-to-image translation of 3D T1w and T2w anatomical MR images
☆17Nov 22, 2022Updated 3 years ago