Nota-NetsPresso / nota-wav2lip
A 28× Compressed Wav2Lip for Efficient Talking Face Generation [ICCV'23 Demo] [MLSys'23 Workshop] [NVIDIA GTC'23]
☆56Updated last year
Alternatives and similar repositories for nota-wav2lip:
Users that are interested in nota-wav2lip are comparing it to the libraries listed below
- A library for training, compressing and deploying computer vision models (including ViT) with edge devices☆68Updated 2 weeks ago
- The official NetsPresso Python package.☆44Updated this week
- A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]☆285Updated 8 months ago
- ☆32Updated 2 years ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆16Updated 8 months ago
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆48Updated last week
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Updated last year
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- Talking head animation☆27Updated last year
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆13Updated 3 years ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆56Updated 8 months ago
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆76Updated 6 months ago
- Official Implementation of LatentSwap:An Efficient Latent Code Mapping Framework for Face Swapping☆12Updated last week
- PersonaTalk Hack☆13Updated 2 months ago
- Talking Face Generation system☆19Updated last year
- ☆13Updated 10 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆48Updated 3 months ago
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆85Updated last year
- ☆54Updated last year
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆33Updated 3 weeks ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆68Updated last year
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆27Updated 2 months ago
- ☆22Updated last year
- Talking Head from Speech Audio using a Pre-trained Image Generator☆23Updated 10 months ago
- Official Access to ICIP2024 "THQA: A Perceptual Quality Assessment Database for Talking Heads"☆31Updated 2 months ago
- End-To-End SpeechSynthesis system with knowledge distillation☆16Updated 2 years ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆38Updated last year
- 4G GPU & 10 Minutes for train☆12Updated last year
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆37Updated last year
- ☆35Updated 11 months ago