sectum1919 / cncvs_data_collectorLinks
☆26Updated 2 years ago
Alternatives and similar repositories for cncvs_data_collector
Users that are interested in cncvs_data_collector are comparing it to the libraries listed below
Sorting:
- ☆14Updated 9 months ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆68Updated last year
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Updated last year
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Updated 2 years ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆52Updated 11 months ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆25Updated last year
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆70Updated last year
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆74Updated last year
- Talking Head from Speech Audio using a Pre-trained Image Generator☆23Updated last year
- A novel apporach for personalized speech-driven 3D facial animation☆55Updated last year
- the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"☆105Updated last year
- The project page repo for Neural Dubber.☆30Updated 2 years ago
- ☆101Updated last month
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 3 years ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆159Updated 8 months ago
- [ICASSP 2024] DiffDub: Person-generic visual dubbing using inpainting renderer with diffusion auto-encoder☆67Updated last year
- Official project repo for paper "Speech Driven Video Editing via an Audio-Conditioned Diffusion Model"☆229Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Updated 2 years ago
- ☆61Updated 5 months ago
- Project of "Adaptive Affine Transformation: A Simple and Effective Operation for Spatial Misaligned Image Generation"☆63Updated 2 years ago
- Source code for: Expressive Speech-driven Facial Animation with controllable emotions☆40Updated last year
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Updated last year
- Preprocessing Scipts for Talking Face Generation☆92Updated 10 months ago
- [AAAI 2024] stle2talker - Official PyTorch Implementation☆46Updated 4 months ago
- Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code☆109Updated 3 years ago
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆35Updated 10 months ago
- ☆51Updated 4 months ago
- Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released☆11Updated 4 years ago
- Unofficial implementation of the paper "One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing" (CVPR 2021 Oral)☆173Updated 4 years ago
- code repo for LoCoNet: Long-Short Context Network for Active Speaker Detection☆46Updated 2 years ago