usc-sail / mica-MovieCLIP
This repository contains the codebase for MovieCLIP: Visual Scene Recognition in Movies
☆37Updated last year
Alternatives and similar repositories for mica-MovieCLIP:
Users that are interested in mica-MovieCLIP are comparing it to the libraries listed below
- Condensed Movies Challenge 2021☆19Updated 2 years ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Updated 2 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆94Updated 4 months ago
- Using pretrained encoder and language models to generate captions from multimedia inputs.☆94Updated 2 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆175Updated 2 years ago
- Aggregating embeddings over time☆31Updated 2 years ago
- Learning to cut end-to-end pretrained modules☆30Updated 8 months ago
- Video shot transition detection☆21Updated 2 years ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆25Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆25Updated last year
- A dataset with classified film shots☆12Updated 2 years ago
- Graph learning framework for long-term video understanding☆59Updated last month
- multimodal video-audio-text generation and retrieval between every pair of modalities on the MUGEN dataset. The repo. contains the traini…☆39Updated last year
- Long-Term Rhythmic Video Soundtracker, ICML2023☆57Updated 8 months ago
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆121Updated 4 months ago
- ☆76Updated 2 years ago
- ☆15Updated 2 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆25Updated 3 years ago
- repo for active speaker detection for media videos.☆26Updated last year
- Easily compute clip embeddings from video frames☆143Updated last year
- This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assis…☆49Updated 2 years ago
- ☆21Updated 4 years ago
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆80Updated 9 months ago
- ☆20Updated 10 months ago
- Anim-400K: A dataset designed from the ground up for automated dubbing of video☆105Updated 9 months ago
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆134Updated last year
- [WACV 2025] - EmoVOCA: Speech-Driven Emotional 3D Talking Heads☆18Updated last month
- A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or v…☆36Updated last year
- Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)☆65Updated last year
- Extracted YouTube 8M URLs and Labels without all the TF Record parsing/features☆24Updated last year