Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]
☆194Sep 21, 2022Updated 3 years ago
Alternatives and similar repositories for CondensedMovies
Users that are interested in CondensedMovies are comparing it to the libraries listed below
Sorting:
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- Tools for movie and video research☆305Jun 20, 2022Updated 3 years ago
- [CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)☆61Aug 17, 2021Updated 4 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆103Nov 6, 2024Updated last year
- This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies☆42Oct 1, 2023Updated 2 years ago
- Narrative movie understanding benchmark☆76Jun 11, 2025Updated 8 months ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions☆172Oct 22, 2023Updated 2 years ago
- A video retrieval dataset How2R and a video QA dataset How2QA☆24Oct 15, 2020Updated 5 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆234May 20, 2024Updated last year
- ☆22Feb 25, 2021Updated 5 years ago
- ☆139Jan 3, 2024Updated 2 years ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆52Aug 9, 2020Updated 5 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,025Apr 12, 2024Updated last year
- Code for the HowTo100M paper☆294Mar 10, 2020Updated 5 years ago
- Easily compute clip embeddings from video frames☆145Oct 31, 2023Updated 2 years ago
- ☆21Aug 26, 2025Updated 6 months ago
- Code for Learning to Learn Language from Narrated Video☆33Oct 3, 2023Updated 2 years ago
- [T-PAMI 2023] Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection☆38Aug 29, 2023Updated 2 years ago
- ☆87Mar 4, 2024Updated 2 years ago
- Video embeddings for retrieval with natural language queries☆342Feb 15, 2023Updated 3 years ago
- Video datasets☆1,618Mar 8, 2023Updated 3 years ago
- [CVPR2023] All in One: Exploring Unified Video-Language Pre-training☆281Mar 25, 2023Updated 2 years ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆105Feb 14, 2023Updated 3 years ago
- Shapley values for assessing the importance of each frame in a video☆17Mar 1, 2021Updated 5 years ago
- Referring expression comprehension on ReferIt(RefClef)☆10Nov 28, 2016Updated 9 years ago
- Data and code for CVPR 2020 paper: "VIOLIN: A Large-Scale Dataset for Video-and-Language Inference"☆161Apr 29, 2020Updated 5 years ago
- A PyTorch implementation of VIOLET☆140Dec 17, 2023Updated 2 years ago
- The HC-STVG Dataset☆62Apr 12, 2023Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆677Aug 14, 2024Updated last year
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆729Aug 8, 2023Updated 2 years ago
- ☆16Dec 25, 2021Updated 4 years ago
- Pytorch version of DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization (NAACL 2021)☆17Jan 12, 2023Updated 3 years ago
- This repository provides the dataset introduced by the paper "Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentenc…☆69May 1, 2020Updated 5 years ago
- Experiments with multimodal deep learning models based on transformers☆11Oct 9, 2022Updated 3 years ago
- ☆71Oct 6, 2023Updated 2 years ago
- Multi-Modal Transformer for Video Retrieval☆265Oct 9, 2024Updated last year