πΎ E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)
β74Jan 20, 2025Updated last year
Alternatives and similar repositories for ETBench
Users that are interested in ETBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequencesβ44Mar 11, 2025Updated last year
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selectionβ141Jul 28, 2025Updated 9 months ago
- β32Jul 29, 2024Updated last year
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attentionβ65Aug 30, 2025Updated 8 months ago
- β80Nov 24, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMsβ129Updated this week
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understandingβ84Jul 4, 2025Updated 9 months ago
- π§ VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)β324Feb 8, 2026Updated 2 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.β20Jul 10, 2025Updated 9 months ago
- Data release for Step Differences in Instructional Video (CVPR24)β14Jun 19, 2024Updated last year
- [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".β296Jun 13, 2024Updated last year
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of uβ¦β26Jun 4, 2025Updated 10 months ago
- β13Apr 13, 2026Updated 2 weeks ago
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentationβ20Jan 3, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β18Jul 10, 2024Updated last year
- β31Nov 17, 2024Updated last year
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Abilityβ106Nov 28, 2024Updated last year
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, β¦β131Apr 4, 2025Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentationβ35Feb 28, 2026Updated 2 months ago
- Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interactβ¦β44Feb 5, 2025Updated last year
- [ICCV 2025] Dynamic-VLMβ28Dec 16, 2024Updated last year
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)β32Mar 29, 2024Updated 2 years ago
- [EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Modelsβ143Aug 21, 2025Updated 8 months ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"β92Mar 9, 2025Updated last year
- β14Oct 30, 2023Updated 2 years ago
- [EMNLP 2025 Main] The official repo of MMLU-ProX benchmark.β28Aug 26, 2025Updated 8 months ago
- Official PyTorch code of GroundVQA (CVPR'24)β64Sep 13, 2024Updated last year
- Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understandingβ291Aug 5, 2025Updated 8 months ago
- β19Jan 26, 2026Updated 3 months ago
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.β122Jul 27, 2024Updated last year
- [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Groundingβ127Dec 10, 2024Updated last year
- β63Feb 27, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Modelsβ40Nov 10, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVUβ425May 8, 2025Updated 11 months ago
- β29Apr 8, 2025Updated last year
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023β17Jul 24, 2023Updated 2 years ago
- R1-like Video-LLM for Temporal Groundingβ135Jun 20, 2025Updated 10 months ago
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.β76Mar 18, 2025Updated last year
- β10Jul 5, 2024Updated last year