keplerlab / katnaLinks
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
☆359Updated 10 months ago
Alternatives and similar repositories for katna
Users that are interested in katna are comparing it to the libraries listed below
Sorting:
- It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.☆168Updated 2 weeks ago
- TransNet V2: Shot Boundary Detection Neural Network☆655Updated last year
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆229Updated last year
- This repository contains script to divide a video into key frames.☆172Updated 7 years ago
- Experimenting with different Summarizing techniques on SumMe Dataset☆138Updated 4 years ago
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆165Updated 2 years ago
- Tools for movie and video research☆290Updated 3 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆361Updated 3 years ago
- [ICCV 2023] UniVTG: Towards Unified Video-Language Temporal Grounding☆356Updated last year
- [NeurIPS 2021] Moment-DETR code and QVHighlights dataset☆313Updated last year
- Code for the HowTo100M paper☆272Updated 5 years ago
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆963Updated last year
- UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …☆220Updated last year
- ☆246Updated 2 years ago
- Video Copy Segment Localization (VCSL) dataset and benchmark [CVPR2022]☆127Updated last year
- Video to Text: Natural language description generator for some given video. [Video Captioning]☆348Updated 3 years ago
- A curated list of deep learning resources for video-text retrieval.☆623Updated last year
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆227Updated 2 years ago
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆178Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆643Updated 10 months ago
- ☆188Updated 11 months ago
- A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it☆138Updated last year
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆289Updated 2 years ago
- ☆133Updated last year
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …☆232Updated last year
- [ECCV 2022] AutoTransition: Learning to Recommend Video Transition Effects☆63Updated 3 months ago
- GIT: A Generative Image-to-text Transformer for Vision and Language☆570Updated last year
- Unsupervised video summarization with deep reinforcement learning (AAAI'18)☆494Updated last year
- The 1st Place Solution of the Facebook AI Image Similarity Challenge (ISC21) : Descriptor Track.☆140Updated last year
- A PyTorch Implementation of PGL-SUM from "Combining Global and Local Attention with Positional Encoding for Video Summarization" (IEEE IS…☆89Updated 2 years ago