Learning to cut end-to-end pretrained modules
☆35Apr 17, 2025Updated 11 months ago
Alternatives and similar repositories for MovieCuts
Users that are interested in MovieCuts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the official repository for our ECCV 2022 paper titled, "The Anatomy of Video Editing: A Dataset and Benchmark Suite for AI-Assis…☆53Nov 28, 2022Updated 3 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆22Jul 26, 2025Updated 8 months ago
- Official Code of ICCV 2021 Paper: Learning to Cut by Watching Movies☆51Nov 9, 2022Updated 3 years ago
- [ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects☆65Mar 6, 2025Updated last year
- ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models☆92Sep 12, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions☆174Oct 22, 2023Updated 2 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆103Nov 6, 2024Updated last year
- Tools for movie and video research☆306Jun 20, 2022Updated 3 years ago
- JPEG-LM: LLMs as Image Generators with Canonical Codec Representations☆14Sep 29, 2024Updated last year
- This project explores the opportunities of deep learning for camera control in virtual cinematography.☆106Jan 22, 2024Updated 2 years ago
- ☆24Mar 16, 2026Updated last week
- ☆15Oct 10, 2023Updated 2 years ago
- ☆27Mar 3, 2025Updated last year
- ☆21Aug 26, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025] VidComposition: Can MLLMs Analyze Compositions in Compiled Videos?☆29May 10, 2025Updated 10 months ago
- [BMVC 2023 (Oral)] SketchDreamer: Interactive Text-Augmented Creative Sketch Ideation☆27Jun 8, 2025Updated 9 months ago
- ☆11Nov 22, 2019Updated 6 years ago
- Official repository of "FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring"☆79Dec 5, 2025Updated 3 months ago
- This repository contains the dataset, codebase, and benchmarks for our paper: <CNVid-3.5M: Build, Filter, and Pre-train the Large-scale P…☆25Nov 28, 2023Updated 2 years ago
- [NeurIPS 2024 Spotlight] code for "Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement"☆19Jan 26, 2025Updated last year
- A dataset for Audio-Visual Sound Event Detection in Movies☆26Jan 23, 2023Updated 3 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- ☆11Sep 11, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆234May 20, 2024Updated last year
- ☆13May 17, 2025Updated 10 months ago
- Code for the CVPR 2020 paper "Learning Instance Occlusion for Panoptic Segmentation"☆13Jun 17, 2020Updated 5 years ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆106Feb 14, 2023Updated 3 years ago
- Narrative movie understanding benchmark☆76Jun 11, 2025Updated 9 months ago
- Database of cinematographic data of real films through film annotations.☆14Aug 4, 2020Updated 5 years ago
- Semi-Supervised Fine-Grained Recognition Challenge at FGVC8☆29Nov 24, 2021Updated 4 years ago
- ☆13Nov 15, 2022Updated 3 years ago
- Classification of the video file into one of the 5 classes (Static, Pan, Tilt, Zoom, Motion-Still) based on the camera/object motion in t…☆16Dec 27, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Consistent Human Image and Video Generation with Spatially Conditioned Diffusion☆16Sep 1, 2025Updated 6 months ago
- A new multi-shot video understanding benchmark Shot2Story with comprehensive video summaries and detailed shot-level captions.☆168Jan 30, 2025Updated last year
- Official implementation of "Interpreting and Controlling Vision Foundation Models via Text Explanations"☆14May 29, 2024Updated last year
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- ☆160Jan 16, 2025Updated last year
- CamReasoner: Reinforcing Camera Movement Understanding via Structured Spatial Reasoning☆28Feb 11, 2026Updated last month