TransNet V2: Shot Boundary Detection Neural Network
☆873Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for TransNetV2
Users that are interested in TransNetV2 are comparing it to the libraries listed below
Sorting:
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆214Apr 18, 2023Updated 2 years ago
- Python and OpenCV-based scene cut/transition detection program & library.☆4,578Updated this week
- TransNet: A deep network for fast detection of common shot transitions☆61Jun 8, 2020Updated 5 years ago
- Code for CVPR 2022 paper "Scene Consistency Representation Learning for Video Scene Segmentation"☆105Feb 14, 2023Updated 3 years ago
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆234May 20, 2024Updated last year
- ClipShots is the first large-scale dataset for shot boundary detection collected from Youtube and Weibo covering more than 20 categories,…☆124Nov 9, 2021Updated 4 years ago
- Tools for movie and video research☆304Jun 20, 2022Updated 3 years ago
- This is our implementation of deepSBD for ClipShots dataset.☆70Jun 11, 2020Updated 5 years ago
- Implementation of the paper 'Ridiculously Fast Shot Boundary Detection with Fully Convolutional Neural Networks' from scratch.☆62Sep 2, 2019Updated 6 years ago
- ☆34Jun 2, 2023Updated 2 years ago
- ☆138Jan 3, 2024Updated 2 years ago
- Large-scale, Fast and Accurate Shot Boundary Detection through Spatio-temporal Convolutional Neural Networks☆70Oct 9, 2020Updated 5 years ago
- ☆62Sep 2, 2024Updated last year
- A simple GUI to show shot boundary detection based on TransNet V2.☆28Dec 5, 2020Updated 5 years ago
- A comparison of ffmpeg, Shotdetect and PySceneDetect for shot transition detection☆125May 5, 2018Updated 7 years ago
- Video shot transition detection☆25Mar 9, 2023Updated 2 years ago
- [ECCV2024] Video Foundation Models & Data for Multimodal Understanding☆2,201Dec 15, 2025Updated 2 months ago
- [ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspect…☆487Aug 12, 2024Updated last year
- ☆11Sep 30, 2021Updated 4 years ago
- An efficient video loader for deep learning with smart shuffling that's super easy to digest☆2,427Jul 17, 2024Updated last year
- 📹 A more flexible framework that can generate videos at any resolution and creates videos from images.☆1,929Updated this week
- [CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers☆674Oct 25, 2024Updated last year
- ☆15Aug 3, 2019Updated 6 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆1,025Apr 12, 2024Updated last year
- Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"☆506Sep 2, 2024Updated last year
- Shot boundary detection (SBD) python program: Internship (Summer 2016) project☆37Oct 28, 2016Updated 9 years ago
- Multi-modality pre-training☆510May 8, 2024Updated last year
- CLIP+MLP Aesthetic Score Predictor☆1,261Jul 1, 2024Updated last year
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- [ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects☆65Mar 6, 2025Updated 11 months ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,471Jun 28, 2024Updated last year
- [TMLR 2025] Latte: Latent Diffusion Transformer for Video Generation.☆1,920Oct 30, 2025Updated 4 months ago
- Implementation of Cross-category Video Highlight Detection via Set-based Learning (ICCV 2021).☆79Aug 27, 2021Updated 4 years ago
- Official implementation of AnimateDiff.☆12,038Jul 31, 2024Updated last year
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,167Nov 18, 2024Updated last year
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding☆686Jan 29, 2025Updated last year
- text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆12,449Nov 4, 2025Updated 3 months ago
- [NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training☆1,683Dec 8, 2023Updated 2 years ago
- A curated list of recent diffusion models for video generation, editing, and various other applications.☆5,466Feb 23, 2026Updated last week