πΎ E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)
β74Jan 20, 2025Updated last year
Alternatives and similar repositories for ETBench
Users that are interested in ETBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequencesβ44Mar 11, 2025Updated last year
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selectionβ141Jul 28, 2025Updated 10 months ago
- β32Jul 29, 2024Updated last year
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attentionβ66Aug 30, 2025Updated 9 months ago
- β81Nov 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understandingβ83Jul 4, 2025Updated 11 months ago
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMsβ138Apr 27, 2026Updated last month
- π§ VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)β340Feb 8, 2026Updated 4 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.β21Jul 10, 2025Updated 10 months ago
- Data release for Step Differences in Instructional Video (CVPR24)β14Jun 19, 2024Updated last year
- [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".β296Jun 13, 2024Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of uβ¦β26Jun 4, 2025Updated last year
- β13Apr 13, 2026Updated last month
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentationβ20Jan 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β18Jul 10, 2024Updated last year
- UniMD: Towards Unifying Moment retrieval and temporal action Detectionβ57Jul 5, 2024Updated last year
- β31Nov 17, 2024Updated last year
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Abilityβ107Nov 28, 2024Updated last year
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, β¦β131Apr 4, 2025Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentationβ36Feb 28, 2026Updated 3 months ago
- Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interactβ¦β44Feb 5, 2025Updated last year
- [ICCV 2025] Dynamic-VLMβ28Dec 16, 2024Updated last year
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)β32Mar 29, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Modelsβ147Aug 21, 2025Updated 9 months ago
- [EMNLP 2025 Main] The official repo of MMLU-ProX benchmark.β29Aug 26, 2025Updated 9 months ago
- β14Oct 30, 2023Updated 2 years ago
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"β95Mar 9, 2025Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)β63Sep 13, 2024Updated last year
- Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understandingβ292Aug 5, 2025Updated 10 months ago
- β19Jan 26, 2026Updated 4 months ago
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.β130Jul 27, 2024Updated last year
- [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Groundingβ128Dec 10, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Modelsβ40Nov 10, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVUβ427May 8, 2025Updated last year
- β29Apr 8, 2025Updated last year
- β66Feb 27, 2026Updated 3 months ago
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023β16Jul 24, 2023Updated 2 years ago
- R1-like Video-LLM for Temporal Groundingβ136Jun 20, 2025Updated 11 months ago
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.β75Mar 18, 2025Updated last year