julianyulu / SyncNetCN
Optimized Syncnet and Chinese enhanced version, EN and CN checkpoints released
☆13Updated 3 years ago
Alternatives and similar repositories for SyncNetCN:
Users that are interested in SyncNetCN are comparing it to the libraries listed below
- PersonaTalk Hack☆14Updated last month
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆28Updated last year
- ☆24Updated 3 years ago
- 复现Wav2Lip作者新的论文☆20Updated last year
- ☆27Updated last year
- Project page for "Improving Few-shot Learning for Talking Face System with TTS Data Augmentation" for ICASSP2023☆85Updated last year
- [ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video☆67Updated 10 months ago
- ☆31Updated last year
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated last year
- 实现基于4k视频的高分辨率人物换衣、虚拟试穿、物品替换☆52Updated 2 years ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆80Updated last year
- wav2lip训练数据预处理综合工具☆40Updated last year
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆60Updated 3 months ago
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆24Updated 4 months ago
- Talking head animation☆27Updated last year
- Something about Talking Head Generation☆32Updated last year
- ☆32Updated 3 years ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆41Updated 2 months ago
- 基于DINet的推理服务,推理视频流和视频☆14Updated last year
- Speech-driven 3D Talking Heads Generation☆61Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- Preprocessing Scipts for Talking Face Generation☆83Updated last month
- A novel apporach for personalized speech-driven 3D facial animation☆45Updated 9 months ago
- ☆49Updated last year
- ☆27Updated last year
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆27Updated last year
- Project of "Adaptive Affine Transformation: A Simple and Effective Operation for Spatial Misaligned Image Generation"☆61Updated last year
- Unoffical LivePortrait Training Script [ 🚧 Under Construction]☆19Updated 3 weeks ago
- Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices☆63Updated 10 months ago