varunmittal50 / key_frame_extraction_publicLinks
This repository is for key frame extraction process.
☆32Updated 3 years ago
Alternatives and similar repositories for key_frame_extraction_public
Users that are interested in key_frame_extraction_public are comparing it to the libraries listed below
Sorting:
- repo for active speaker detection for media videos.☆30Updated 2 years ago
- an optimized, production-ready implementation of active speaker detection☆73Updated last year
- A deep learning model to lip-sync a given video with any given audio. It uses GAN architecture to orchestrate loss reconstruction or trai…☆122Updated 2 years ago
- Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)☆358Updated 2 years ago
- [CVPR] MARLIN: Masked Autoencoder for facial video Representation LearnINg☆259Updated 7 months ago
- optimized wav2lip☆18Updated last year
- Faster Talking Face Animation on Xeon CPU☆129Updated 2 years ago
- lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based…☆138Updated 10 months ago
- A curated list of resources of audio-driven talking face generation☆143Updated 3 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆104Updated 5 months ago
- Avatar Generation For Characters and Game Assets Using Deep Fakes☆231Updated last year
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆212Updated last year
- PyTorch Implementation for Paper "Emotionally Enhanced Talking Face Generation" (ICCVW'23 and ACM-MMW'23)☆375Updated 10 months ago
- openai/whisper + extra features☆89Updated 3 years ago
- The code for some apps built with Sieve.☆83Updated 11 months ago
- A curated list of 'Talking Head Generation' resources. Features influential papers, groundbreaking algorithms, crucial GitHub repositorie…☆76Updated 2 years ago
- [NeurIPS 2024] This is the official repo of the paper "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Li…☆129Updated 9 months ago
- ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'☆427Updated 2 years ago
- ICASSP2024: Adaptive Super Resolution For One-Shot Talking-Head Generation☆180Updated last year
- Speech to Facial Animation using GANs☆40Updated 4 years ago
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆96Updated 3 years ago
- Alternative to Flawless AI's TrueSync. Make lips in video match provided audio using the power of Wav2Lip and GFPGAN.☆127Updated last year
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆117Updated 2 years ago
- ☆15Updated last year
- 👗 DM-VTON: Distilled Mobile Real-time Virtual Try-On☆137Updated last year
- AI Talking Head: create video from plain text or audio file in minutes, support up to 100+ languages and 350+ voice models.☆36Updated 3 years ago
- Age Estimation with PyTorch: Deep Learning for Predicting Age☆71Updated last year
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆65Updated last year
- The code for the paper "Speech Driven Talking Face Generation from a Single Image and an Emotion Condition"☆172Updated 2 years ago
- Audio-Visual Generative Adversarial Network for Face Reenactment☆157Updated 2 months ago