tychen-SJTU/MECD-Benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tychen-SJTU/MECD-Benchmark)

tychen-SJTU / MECD-Benchmark

[NeurIPS'24 spotlight] MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning. [TPAMI'25] MECD+

☆50

Alternatives and similar repositories for MECD-Benchmark

Users that are interested in MECD-Benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WissingChen / CRA-GQA
View on GitHub
The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"
☆52Apr 27, 2025Updated last year
yl3800 / EIGV
View on GitHub
☆15Aug 12, 2022Updated 3 years ago
Andy-Cheng / TEMPURA
View on GitHub
TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…
☆27Jun 4, 2025Updated last year
hshjerry / VideoEspresso
View on GitHub
[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
☆140Jul 28, 2025Updated last year
MAC-AutoML / WFS-SB
View on GitHub
[CVPR 2026] Wavelet-based Frame Selection by Detecting Semantic Boundary for Long Video Understanding
☆32Apr 12, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
csbobby / STAR_Benchmark
View on GitHub
☆36Apr 18, 2024Updated 2 years ago
renjie-liang / HUAL
View on GitHub
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning
☆15Dec 12, 2023Updated 2 years ago
scofield7419 / Video-of-Thought
View on GitHub
Video Chain of Thought, Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"
☆182Feb 25, 2025Updated last year
Letian2003 / C-VQA
View on GitHub
Counterfactual Reasoning VQA Dataset
☆28Nov 23, 2023Updated 2 years ago
mengcaopku / SpatialDreamer
View on GitHub
SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery
☆15Feb 1, 2026Updated 5 months ago
facebookresearch / CausalVQA
View on GitHub
We introduce CausalVQA, a benchmark dataset for video question answering (VQA) composed of question-answer pairs that probe models’ under…
☆62Aug 18, 2025Updated 11 months ago
sming256 / BOLT
View on GitHub
[CVPR2025] BOLT: Boost Large Vision-Language Model Without Training for Long-form Video Understanding
☆55Feb 5, 2026Updated 5 months ago
Lkydong2020 / SEAL_WTAL
View on GitHub
[AAAI-25]Code for SEAL
☆15Sep 25, 2025Updated 10 months ago
OuyangKun10 / Conan
View on GitHub
Multi-step reasoning MLLM
☆25Mar 8, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
svip-lab / SVIP-Sequence-VerIfication-for-Procedures-in-Videos
View on GitHub
[CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos
☆24Feb 24, 2023Updated 3 years ago
DCDmllm / Momentor
View on GitHub
☆81Nov 24, 2024Updated last year
mlvlab / VidChain
View on GitHub
Official Implementation (Pytorch) of the "VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Capti…
☆25Jan 26, 2025Updated last year
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
sejong-rcv / PVLR
View on GitHub
[ACM MM-24] Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization
☆13Oct 8, 2024Updated last year
seyoungahn / FedDif
View on GitHub
Official implementations for "Communication-Efficient Diffusion Strategy for Performance Improvement of Federated Learning with Non-IID D…
☆22Mar 14, 2024Updated 2 years ago
doc-doc / NExT-QA
View on GitHub
NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)
☆189Aug 2, 2025Updated 11 months ago
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆23Feb 16, 2024Updated 2 years ago
ZijiaLewisLu / CVPR2025-DeCafNet
View on GitHub
Official Repo for CVPR 2025 Paper -- DeCafNet: Delegate and Conquer for Efficient Temporal Grounding in Long Videos
☆17Mar 16, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Fdioa / PruneRAG
View on GitHub
☆30Jun 26, 2026Updated last month
hucvl / craft
View on GitHub
CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions
☆16Jun 10, 2021Updated 5 years ago
djsutherland / igms
View on GitHub
Implicit generative models and related stuff based on the MMD, in PyTorch
☆16Sep 24, 2020Updated 5 years ago
Macielyoung / FinCUGE_Instruction
View on GitHub
FinCUGE Instruction dataset
☆16Apr 29, 2023Updated 3 years ago
matthklein / fair_k_center_clustering
View on GitHub
Code for our paper "Fair k-Center Clustering for Data Summarization"
☆12Apr 26, 2019Updated 7 years ago
TencentARC / GRPO-CARE
View on GitHub
[ACL2026 Findings] GRPO-CARE: Consistency-Aware Reinforcement Learning for Multimodal Reasoning
☆83Jun 23, 2025Updated last year
CausalVerse / CausalVerseBenchmark
View on GitHub
CausalVerse is a comprehensive benchmark for Causal Representation Learning (CRL) focused on recovering the data-generating process.
☆16Apr 10, 2026Updated 3 months ago
xxxiaol / counterfactual-recipe-generation
View on GitHub
Source code and data for Counterfactual Recipe Generation: Exploring Models’ Compositional Generalization Ability in a Realistic Scenario…
☆15Oct 25, 2022Updated 3 years ago
Tanveer81 / ReVisionLLM
View on GitHub
This is the official implementation of ReVisionLLM: Recursive Vision-Language Model for Temporal Grounding in Hour-Long Videos
☆47Nov 5, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
VinAIResearch / HyperCUT
View on GitHub
HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)
☆14Nov 4, 2025Updated 8 months ago
zhenjia-xu / DensePhysNet-Simulation
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
traveler-framework / TraveLER
View on GitHub
[EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering
☆18Oct 31, 2024Updated last year
chakravarthi589 / Video-Question-Answering_Resources
View on GitHub
Video Question Answering | Video QA | VQA
☆97Jun 12, 2026Updated last month
ictnlp / FastLongSpeech
View on GitHub
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech process…
☆16Jul 22, 2025Updated last year
liuting20 / MustDrop
View on GitHub
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
☆36Jan 8, 2025Updated last year
microsoft / FIVE-UI-Evol
View on GitHub
☆31Apr 15, 2026Updated 3 months ago