Tool for automating common video key-frame extraction, video compression and Image Auto-crop/Image-resize tasks
☆390Aug 6, 2024Updated last year
Alternatives and similar repositories for katna
Users that are interested in katna are comparing it to the libraries listed below
Sorting:
- Experimenting with different Summarizing techniques on SumMe Dataset☆143Jul 7, 2020Updated 5 years ago
- Using Color Histogram, SVD and Dynamic Clustering Method obtained Key-Frames from a video. This analysis can be used to identify frames w…☆24Dec 14, 2020Updated 5 years ago
- ☆62Sep 2, 2024Updated last year
- Key-frame based summarization of videos☆30Dec 8, 2022Updated 3 years ago
- Python and OpenCV-based scene cut/transition detection program & library.☆4,578Feb 27, 2026Updated last week
- Video dataset and code for transforming a video's aspect ratio, from our papers "A fast smart-cropping method and dataset for video retar…☆29Jun 7, 2022Updated 3 years ago
- [CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting☆27Aug 30, 2024Updated last year
- Video Key Frame Extraction Using Local Descriptors Based on Deep Learning Method(Superpoint)☆52Aug 15, 2019Updated 6 years ago
- ☆18May 4, 2025Updated 10 months ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆116Sep 15, 2022Updated 3 years ago
- Deep Neural Network - Automatic selection of Thumbnails for Videos☆35Mar 1, 2018Updated 8 years ago
- Hook for persisting and rehydrating state in the React app☆10Nov 16, 2019Updated 6 years ago
- ☆11Sep 23, 2023Updated 2 years ago
- Finally, some decent sample sentences☆23Dec 3, 2023Updated 2 years ago
- ☆20Apr 24, 2024Updated last year
- TransNet V2: Shot Boundary Detection Neural Network☆880Dec 4, 2023Updated 2 years ago
- Optimal deep texture generation and style transfer based on Eric Risser's paper☆26Feb 25, 2022Updated 4 years ago
- A custom node for ComfyUI that adds cinematic and movie scene styles to video generation prompts. This node helps create more dynamic and…☆46Dec 31, 2024Updated last year
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆22Nov 1, 2025Updated 4 months ago
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 7 months ago
- This is the official repository of Emotion-Driven Melody Harmonization via Melodic Variation and Functional Representation.☆12Sep 25, 2024Updated last year
- ☆12Nov 22, 2022Updated 3 years ago
- Open temporary areas onscreen☆11Nov 5, 2021Updated 4 years ago
- A large-scale evaluation benchmark called DeepFaceGen, aimed at quantitatively assessing the effectiveness of face forgery detection and …☆25May 31, 2025Updated 9 months ago
- DSNet: A Flexible Detect-to-Summarize Network for Video Summarization☆219Sep 16, 2021Updated 4 years ago
- A repository for sharing ipynb's of my experiments with ML. Some notebooks are 'old' by now and might no longer work 'out of the box'.☆77Sep 23, 2022Updated 3 years ago
- https://liuzeming01.github.io/XDailyDialog/☆13Jun 25, 2023Updated 2 years ago
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 4 months ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- ☆159Jan 16, 2025Updated last year
- Lyrics and Vocal Melody Generation conditioned on Accompaniment☆29Aug 27, 2022Updated 3 years ago
- Implementation of DiffusionOverDiffusion architecture presented in NUWA-XL in a form of ControlNet-like module on top of ModelScope text2…☆86Apr 22, 2023Updated 2 years ago
- [ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the cap…☆1,492Aug 5, 2025Updated 7 months ago
- Using CogVLM and CogAgent for image captioning☆15Dec 29, 2023Updated 2 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- IMAGEimate is an end-to-end pipeline to create realistic animatable 3D avatars from a single image using neural networks☆13Dec 9, 2021Updated 4 years ago
- [ICLR2026] VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling☆511Nov 18, 2025Updated 3 months ago
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding☆688Jan 29, 2025Updated last year