[ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities"
☆22Jun 12, 2025Updated 10 months ago
Alternatives and similar repositories for OmniBench
Users that are interested in OmniBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆28Jul 23, 2025Updated 8 months ago
- 这是一个爬取雪球网选股策略并保持到MySQL的小程序。☆15Aug 16, 2017Updated 8 years ago
- 机器学习乐园:主要包括机器学习基础,深度学习实践,工业应用。☆15Nov 14, 2022Updated 3 years ago
- [ICLR 2026] "VideoReasonBench: Can MLLMs Perform Vision-Centric Complex Video Reasoning?", Yuanxin Liu, Kun Ouyang, Haoning Wu, Yi Liu, L…☆37Jan 30, 2026Updated 2 months ago
- Interpreting CLIP with Hierarchical Sparse Autoencoders (ICML 2025)☆25Jan 17, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 浙江大学PAT题解☆18Sep 2, 2024Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Jul 21, 2023Updated 2 years ago
- Indoor-Navigation-System based on SLAM & AR☆15May 27, 2020Updated 5 years ago
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆23Nov 1, 2025Updated 5 months ago
- 基于stm32 1602显示方式4轮智能小车例程☆15Feb 23, 2016Updated 10 years ago
- ☆13Nov 2, 2025Updated 5 months ago
- 【计算机网络课程作业】网络聊天室,涵盖了基本的socket网络编程、Tkinter图像化界 面、MySQL数据库等技术,可实现表情包的发送、单用户私聊、机器人对话等功能☆10Nov 4, 2025Updated 5 months ago
- Package to align tokens from different tokenizations.☆16Mar 25, 2024Updated 2 years ago
- A complete introductory course to programming, computer systems and software development (continuously updating).☆12Feb 21, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ssm学习项目,物业管理系统,分管理后台和用户后台☆17Apr 1, 2019Updated 7 years ago
- ☆29Sep 4, 2025Updated 7 months ago
- 机器人学导论代码仓库☆14Jul 8, 2020Updated 5 years ago
- ☆11Apr 5, 2020Updated 6 years ago
- [ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".☆30Feb 12, 2026Updated 2 months ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆50Jan 30, 2024Updated 2 years ago
- A simple Google Search Engine Crawler.☆22Feb 16, 2024Updated 2 years ago
- A python3 GUI for famous antivirus clamav.☆19Jun 28, 2021Updated 4 years ago
- Extending context length of visual language models☆12Dec 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆13Nov 1, 2025Updated 5 months ago
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆40Mar 25, 2026Updated 2 weeks ago
- ☆18Jul 25, 2025Updated 8 months ago
- 基于STM32的车牌识别系统☆14Mar 16, 2018Updated 8 years ago
- ☆24Nov 20, 2025Updated 4 months ago
- 📑一键将当前打开的PPT的页面导出为PDF,并自动裁剪白边。非常适合科研人员在使用PPT作图时,修改PPT后快速导出PDF插入到论文中。 One click to export the currently open PPT page as a PDF and automa…☆27Apr 1, 2026Updated 2 weeks ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆90Oct 15, 2025Updated 5 months ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆23Jul 16, 2025Updated 8 months ago
- IS416 Final Project. A PoW-based blockchain implementation with attackers trying fork attacks. Language: Go.☆17Jan 10, 2022Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- UniVid: The Open-Source Unified Video Model☆31Oct 13, 2025Updated 6 months ago
- Dedicated to building industrial foundation models for universal data intelligence across industries.☆61Aug 19, 2024Updated last year
- ☆13Nov 5, 2024Updated last year
- [CVPR 2026] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning☆70Mar 25, 2026Updated 2 weeks ago
- ☆36Jan 9, 2026Updated 3 months ago
- Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"☆23Mar 18, 2026Updated 3 weeks ago
- ☆14May 3, 2022Updated 3 years ago