πΎ E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)
β74Jan 20, 2025Updated last year
Alternatives and similar repositories for ETBench
Users that are interested in ETBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequencesβ45Mar 11, 2025Updated last year
- [CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selectionβ140Jul 28, 2025Updated 11 months ago
- β32Jul 29, 2024Updated last year
- [NeurIPS 2024] Mitigating Object Hallucination via Concentric Causal Attentionβ66Aug 30, 2025Updated 9 months ago
- β81Nov 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understandingβ84Jul 4, 2025Updated 11 months ago
- [CVPR 2026] TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMsβ146Apr 27, 2026Updated 2 months ago
- π§ VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)β343Feb 8, 2026Updated 4 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.β21Jul 10, 2025Updated 11 months ago
- Data release for Step Differences in Instructional Video (CVPR24)β15Jun 19, 2024Updated 2 years ago
- [CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".β296Jun 13, 2024Updated 2 years ago
- TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of uβ¦β27Jun 4, 2025Updated last year
- β13Apr 13, 2026Updated 2 months ago
- [NeurIPS 2023] Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentationβ20Jan 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β18Jul 10, 2024Updated last year
- UniMD: Towards Unifying Moment retrieval and temporal action Detectionβ57Jul 5, 2024Updated last year
- β31Nov 17, 2024Updated last year
- A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Abilityβ107Nov 28, 2024Updated last year
- [ACL 2024 Findings] "TempCompass: Do Video LLMs Really Understand Videos?", Yuanxin Liu, Shicheng Li, Yi Liu, Yuxiang Wang, Shuhuai Ren, β¦β132Apr 4, 2025Updated last year
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentationβ36Feb 28, 2026Updated 4 months ago
- Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interactβ¦β43Feb 5, 2025Updated last year
- [ICCV 2025] Dynamic-VLMβ28Dec 16, 2024Updated last year
- The official code of Towards Balanced Alignment: Modal-Enhanced Semantic Modeling for Video Moment Retrieval (AAAI2024)β32Mar 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Modelsβ148Aug 21, 2025Updated 10 months ago
- [EMNLP 2025 Main] The official repo of MMLU-ProX benchmark.β29Aug 26, 2025Updated 10 months ago
- β14Oct 30, 2023Updated 2 years ago
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"β95Mar 9, 2025Updated last year
- Official PyTorch code of GroundVQA (CVPR'24)β63Sep 13, 2024Updated last year
- Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understandingβ292Aug 5, 2025Updated 10 months ago
- β18Jan 26, 2026Updated 5 months ago
- [Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.β131Jul 27, 2024Updated last year
- [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Groundingβ129Dec 10, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Modelsβ40Nov 10, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVUβ427May 8, 2025Updated last year
- β29Apr 8, 2025Updated last year
- Implementation Code for paper "Efficient Multimodal Fusion via Interactive Prompting" in CVPR2023β16Jul 24, 2023Updated 2 years ago
- R1-like Video-LLM for Temporal Groundingβ137Jun 20, 2025Updated last year
- A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.β75Mar 18, 2025Updated last year
- β10Jul 5, 2024Updated last year