[ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities"
☆22Jun 12, 2025Updated 9 months ago
Alternatives and similar repositories for OmniBench
Users that are interested in OmniBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆27Jul 23, 2025Updated 8 months ago
- 这是一个爬取雪球网选股策略并保持到MySQL的小程序。☆15Aug 16, 2017Updated 8 years ago
- 机器学习乐园:主要包括机器学习基础,深度学习实践,工业应用。☆15Nov 14, 2022Updated 3 years ago
- [ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…☆37Jan 30, 2026Updated last month
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆24Jan 17, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 浙江大学PAT题解☆18Sep 2, 2024Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Jul 21, 2023Updated 2 years ago
- Indoor-Navigation-System based on SLAM & AR☆15May 27, 2020Updated 5 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- 基于stm32 1602显示方式4轮智能小车例程☆15Feb 23, 2016Updated 10 years ago
- ☆12Nov 2, 2025Updated 4 months ago
- 【计算机网络课程作业】网络聊天室,涵盖了基本的socket网络编程、Tkinter图像化界面、MySQL数据库等技术,可实现表情包的发送、单用户私聊、机器人对话等功能☆10Nov 4, 2025Updated 4 months ago
- Package to align tokens from different tokenizations.☆16Mar 25, 2024Updated 2 years ago
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Feb 21, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ssm学习项目,物业管理系统,分管理后台和用户后台☆17Apr 1, 2019Updated 6 years ago
- ☆29Sep 4, 2025Updated 6 months ago
- 机器人学导论代码仓库☆14Jul 8, 2020Updated 5 years ago
- ☆11Apr 5, 2020Updated 5 years ago
- [ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".☆30Feb 12, 2026Updated last month
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆50Jan 30, 2024Updated 2 years ago
- A simple Google Search Engine Crawler.☆21Feb 16, 2024Updated 2 years ago
- ☆24Nov 20, 2025Updated 4 months ago
- A python3 GUI for famous antivirus clamav.☆19Jun 28, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- ☆14May 3, 2022Updated 3 years ago
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆39Mar 11, 2026Updated 2 weeks ago
- ☆18Jul 25, 2025Updated 8 months ago
- 基于STM32的车牌识别系统☆14Mar 16, 2018Updated 8 years ago
- 📑一键将当前打开 的PPT的页面导出为PDF,并自动裁剪白边。非常适合科研人员在使用PPT作图时,修改PPT后快速导出PDF插入到论文中。 One click to export the currently open PPT page as a PDF and automa…☆26Nov 24, 2025Updated 4 months ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆87Oct 15, 2025Updated 5 months ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- IS416 Final Project. A PoW-based blockchain implementation with attackers trying fork attacks. Language: Go.☆17Jan 10, 2022Updated 4 years ago
- UniVid: The Open-Source Unified Video Model☆30Oct 13, 2025Updated 5 months ago
- Dedicated to building industrial foundation models for universal data intelligence across industries.☆62Aug 19, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- [CVPR 2026] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning☆61Mar 5, 2026Updated 2 weeks ago
- ☆35Jan 9, 2026Updated 2 months ago
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆21Mar 18, 2026Updated last week