PardoAlejo / LearningToCut
Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies
☆51Updated 2 years ago
Alternatives and similar repositories for LearningToCut:
Users that are interested in LearningToCut are comparing it to the libraries listed below
- Learning to cut end-to-end pretrained modules☆28Updated 6 months ago
- This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assis…☆48Updated 2 years ago
- Condensed Movies Challenge 2021☆17Updated 2 years ago
- VideoCC is a dataset containing (video-URL, caption) pairs for training video-text machine learning models. It is created using an automa…☆76Updated 2 years ago
- ☆105Updated 2 years ago
- MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions☆156Updated last year
- Implementation of Cross-category Video Highlight Detection via Set-based Learning (ICCV 2021).☆72Updated 3 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆167Updated 2 years ago
- (wip) Use LAION-AI's CLIP "conditoned prior" to generate CLIP image embeds from CLIP text embeds.☆27Updated 2 years ago
- ☆120Updated last year
- ☆31Updated last year
- Extracted YouTube 8M URLs and Labels without all the TF Record parsing/features☆23Updated last year
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆91Updated 2 months ago
- ☆31Updated 3 years ago
- Use CLIP to represent video for Retrieval Task☆69Updated 3 years ago
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆119Updated 2 months ago
- ☆72Updated 8 months ago
- Python3 Implementation for 'Visual Rhythm and Beat' SIGGRAPH 2018☆18Updated 2 years ago
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆24Updated last year
- [CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.☆114Updated last year
- ☆50Updated 2 years ago
- ☆98Updated 2 months ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- ☆54Updated 2 years ago
- Video shot transition detection☆21Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆78Updated 9 months ago
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆17Updated last year
- [Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training☆21Updated 2 years ago
- Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)☆52Updated 11 months ago