keplerlab / katna
Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
☆352Updated 8 months ago
Alternatives and similar repositories for katna:
Users that are interested in katna are comparing it to the libraries listed below
- It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.☆154Updated 5 months ago
- Codebase for CVPR2020 A Local-to-Global Approach to Multi-modal Movie Scene Segmentation☆228Updated 11 months ago
- Tools for movie and video research☆288Updated 2 years ago
- Large-scale text-video dataset. 10 million captioned short videos.☆629Updated 8 months ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆360Updated 2 years ago
- [NeurIPS 2021] Moment-DETR code and QVHighlights dataset☆303Updated last year
- Story-Based Retrieval with Contextual Embeddings. Largest freely available movie video dataset. [ACCV'20]☆177Updated 2 years ago
- ☆129Updated last year
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆147Updated 2 years ago
- This repository contains script to divide a video into key frames.☆169Updated 7 years ago
- ☆244Updated 2 years ago
- Key-frame based summarization of videos☆27Updated 2 years ago
- Experimenting with different Summarizing techniques on SumMe Dataset☆138Updated 4 years ago
- Source code for "Bi-modal Transformer for Dense Video Captioning" (BMVC 2020)☆227Updated 2 years ago
- TransNet V2: Shot Boundary Detection Neural Network☆601Updated last year
- Search photos on Unsplash based on OpenAI's CLIP model, support search with joint image+text queries and attention visualization.☆221Updated 3 years ago
- Using Color Histogram, SVD and Dynamic Clustering Method obtained Key-Frames from a video. This analysis can be used to identify frames w…☆22Updated 4 years ago
- ☆76Updated 2 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆288Updated 2 years ago
- Code for the HowTo100M paper☆267Updated 5 years ago
- Video to Text: Natural language description generator for some given video. [Video Captioning]☆343Updated 2 years ago
- Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…☆591Updated 2 months ago
- A curated list of deep learning resources for video-text retrieval.☆617Updated last year
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆138Updated 2 years ago
- UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or …☆212Updated last year
- Video embeddings for retrieval with natural language queries☆341Updated 2 years ago
- Align and Prompt: Video-and-Language Pre-training with Entity Prompts☆187Updated 2 years ago
- Easily compute clip embeddings from video frames☆144Updated last year
- An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"☆156Updated last year
- An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"☆935Updated last year