varunmittal50 / key_frame_extraction_public
This repository is for key frame extraction process.
☆31Updated 2 years ago
Alternatives and similar repositories for key_frame_extraction_public:
Users that are interested in key_frame_extraction_public are comparing it to the libraries listed below
- This code implements a versatile image search engine leveraging the CLIP model and FAISS, capable of processing both text-to-image and i…☆40Updated last year
- Key-frame based summarization of videos☆25Updated 2 years ago
- An automated highlight generation tool for sports videos☆13Updated 2 years ago
- repo for active speaker detection for media videos.☆22Updated last year
- Automate Fashion Image Captioning using BLIP-2. Automatic generating descriptions of clothes on shopping websites, which can help custome…☆52Updated last year
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆113Updated 9 months ago
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆32Updated 2 years ago
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆94Updated 2 years ago
- ☆27Updated last year
- OpenCV implementation of facial smoothing. Facial detection is done using an pretrained TensorFlow face detection model.☆50Updated 2 weeks ago
- Model for watermark classification implemented with PyTorch☆105Updated 4 months ago
- AI toolbox and pretrain models.☆37Updated 11 months ago
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆59Updated 2 months ago
- AutoShot: A Short Video Dataset and State-of-the-Art Shot Boundary Detection - CVPR NAS 2023☆122Updated last year
- Video shot transition detection☆21Updated last year
- optimized wav2lip☆19Updated last year
- Preprocessing Scipts for Talking Face Generation☆78Updated 5 months ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆36Updated 4 months ago
- A deep learning model to lip-sync a given video with any given audio. It uses GAN architecture to orchestrate loss reconstruction or trai…☆113Updated last year
- Speech to Facial Animation using GANs☆41Updated 3 years ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆13Updated 3 years ago
- It is a simple python tool to extract key-frames from a video file using peak estimation from frame difference.☆150Updated last month
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆54Updated 5 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆67Updated 6 months ago
- Human body part segmentation model, trained with 22 class labels.☆15Updated last year
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆174Updated 9 months ago
- An application to automatically remove selected objects from images and videos☆27Updated 3 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Updated 10 months ago
- Multi-modal transformer approach for natural language query based joint video summarization and highlight detection☆13Updated 7 months ago