julianyulu / SyncNetCNLinks
Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released
☆11Updated 4 years ago
Alternatives and similar repositories for SyncNetCN
Users that are interested in SyncNetCN are comparing it to the libraries listed below
Sorting:
- PersonaTalk Hack☆14Updated 11 months ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 3 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Updated 2 years ago
- 复现Wav2Lip作者新的论文☆20Updated 2 years ago
- ☆24Updated 4 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated 2 years ago
- 实现基于4k视频的高分辨率人物换衣、虚拟试穿、物品替换☆56Updated 3 years ago
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆96Updated 3 years ago
- Something about Talking Head Generation☆32Updated 2 years ago
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆74Updated last year
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆82Updated last year
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Updated last year
- ☆101Updated last month
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆86Updated 2 years ago
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆109Updated 2 years ago
- Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…☆62Updated 2 years ago
- ☆49Updated 2 years ago
- 基于DINet的推理服务,推理视频流和视频☆16Updated 2 years ago
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆213Updated 2 years ago
- The MAVD represents Mandarin Audio-Visual dataset with Depth information. MAVD has a rich variety of modal data, including audio, RGB ima…☆20Updated last year
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆65Updated last year
- ☆28Updated 2 years ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18Updated 2 years ago
- wav2lip训练数据预处理综合工具☆39Updated 2 years ago
- ☆15Updated last year
- ☆26Updated 2 years ago
- Psyche AI Inc release source "CVCUDA_FaceStoreHelper"☆66Updated 2 years ago
- ☆34Updated 3 years ago
- ☆27Updated 2 years ago
- Preprocessing Scipts for Talking Face Generation☆92Updated 10 months ago