Official code of *Towards Event-oriented Long Video Understanding*
☆12Jul 26, 2024Updated last year
Alternatives and similar repositories for Event-Bench
Users that are interested in Event-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Jul 29, 2024Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆55Mar 9, 2025Updated last year
- Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows☆19Nov 4, 2025Updated 4 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Feb 27, 2025Updated last year
- [NeurIPS 2025] Scaling Language-centric Omnimodal Representation Learning☆36Feb 6, 2026Updated last month
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆59Jun 27, 2023Updated 2 years ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109May 27, 2025Updated 9 months ago
- ☆138Sep 29, 2024Updated last year
- textteaser中文版☆11Jun 2, 2018Updated 7 years ago
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Nov 10, 2023Updated 2 years ago
- This is about the English test - cloze test which use the Google BERT model to predict the probable word.☆11May 18, 2021Updated 4 years ago
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago
- Temporal Compact Bilinear Pooling (TCBP)☆11May 27, 2020Updated 5 years ago
- ☆10May 24, 2023Updated 2 years ago
- Codes relevant to ontology alignment☆14Aug 25, 2021Updated 4 years ago
- official implementation of 'STREAM : Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models'☆28Dec 24, 2025Updated 2 months ago
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.☆74Oct 14, 2024Updated last year
- Code and Data for "SCTc-TE: A Comprehensive Formulation and Benchmark for Temporal Event Forecasting""☆16Feb 2, 2024Updated 2 years ago
- Offical implemention of Robust Superpixel-Guided Attentional Adversarial Attack (CVPR2020)☆10Jan 5, 2022Updated 4 years ago
- (EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives☆37Dec 20, 2025Updated 3 months ago
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆42Dec 16, 2025Updated 3 months ago
- ☆157Oct 31, 2024Updated last year
- ☆19Jun 10, 2025Updated 9 months ago
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆94Apr 29, 2024Updated last year
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆52Jul 11, 2025Updated 8 months ago
- Code for our ACL 2025 paper "Language Repository for Long Video Understanding"☆36Jun 17, 2024Updated last year
- ☆11May 24, 2024Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20May 27, 2025Updated 9 months ago
- Long Context Transfer from Language to Vision☆402Mar 18, 2025Updated last year
- Multi-task learning of Abstractive Summarization with Entailment Generation implemented using PyTorch☆16Jun 11, 2018Updated 7 years ago
- The ASS subtitle special effects generating tool, super awesome effects!!! Made by MeteorX!!!☆12Dec 4, 2021Updated 4 years ago
- ☆74May 10, 2024Updated last year
- LaTeX 排版学术论文编写规则(国家标准GB/T 7713.2—2022)☆24Mar 25, 2024Updated last year
- [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"☆196Mar 17, 2025Updated last year
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- Densely Captioned Images (DCI) dataset repository.☆198Jul 1, 2024Updated last year
- Python implementation for paper: Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples☆11Jun 12, 2018Updated 7 years ago
- ☆36Mar 18, 2025Updated last year
- Official PyTorch implementation of RACRO (https://www.arxiv.org/abs/2506.04559)☆19Jul 1, 2025Updated 8 months ago