flyingshan / chinese_speech_feature_extractionView external linksLinks
Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech.
☆21Mar 16, 2023Updated 2 years ago
Alternatives and similar repositories for chinese_speech_feature_extraction
Users that are interested in chinese_speech_feature_extraction are comparing it to the libraries listed below
Sorting:
- ☆38Nov 10, 2024Updated last year
- Chroma key (green screen removal) algorithms with Python☆11Jul 14, 2024Updated last year
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆12May 26, 2024Updated last year
- Talking head animation☆28Dec 8, 2023Updated 2 years ago
- ☆24Oct 8, 2021Updated 4 years ago
- Automatically generate a lip-synced avatar based off of a transcript and audio☆14Feb 17, 2023Updated 2 years ago
- 基于MuseTalk的数字人代码。☆35Sep 14, 2024Updated last year
- Python的音频工具☆16Dec 5, 2025Updated 2 months ago
- audiolm-pytorch training code☆15Jul 31, 2023Updated 2 years ago
- 基于DINet的推理服务,推理视频流和视频☆16Nov 8, 2023Updated 2 years ago
- PersonaTalk Hack☆15Jan 10, 2025Updated last year
- ☆18Jul 16, 2024Updated last year
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18May 16, 2023Updated 2 years ago
- ☆17Apr 3, 2017Updated 8 years ago
- Faster Talking Face Animation on Xeon CPU☆130Nov 14, 2023Updated 2 years ago
- This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…☆17Aug 31, 2023Updated 2 years ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- 复现Wav2Lip作者新的论文☆20Jun 20, 2023Updated 2 years ago
- Unofficial implementation of the paper: StyleSwap: Style-Based Generator Empowers Robust Face Swapping☆52Oct 26, 2022Updated 3 years ago
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Dec 12, 2023Updated 2 years ago
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆53Sep 29, 2022Updated 3 years ago
- Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars☆393Apr 8, 2025Updated 10 months ago
- Code for CVPR 2022 paper "Blind Face Restoration via Integrating Face Shape and Generative Priors"☆25Jan 4, 2023Updated 3 years ago
- HyperGaussians: High-Dimensional Gaussian Splatting for High-Fidelity Animatable Face Avatars☆38Jan 21, 2026Updated 3 weeks ago
- Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition☆927Apr 4, 2024Updated last year
- Wav2Lip version 288 and pipeline to train☆642Aug 13, 2025Updated 6 months ago
- ☆428Nov 1, 2023Updated 2 years ago
- Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…☆63Sep 23, 2023Updated 2 years ago
- ☆28Oct 1, 2023Updated 2 years ago
- [ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".☆343Jan 10, 2023Updated 3 years ago
- wav2lip in a Vector Quantized (VQ) space☆27Jun 20, 2023Updated 2 years ago
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated 8 months ago
- 中文到表情☆31May 12, 2022Updated 3 years ago
- [NeurIPS 2024] This is the official repo of the paper "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Li…☆135Feb 9, 2025Updated last year
- ☆14Mar 12, 2023Updated 2 years ago
- 这是一个在wav2lip,使用wav2lip、gfpgan、yolov5等模型用RT加速的超快推理!经测试在2070显卡上可达到0.03秒每帧实现实时推理。☆31Sep 23, 2025Updated 4 months ago
- ☆30Jun 12, 2025Updated 8 months ago
- Chatbot with a 3D avatar that can answer interview questions in your behalf. It can speak and understand English, German and Albanian. Ba…☆40Nov 16, 2025Updated 3 months ago
- Native build of Google's webrtc library.☆35Dec 17, 2024Updated last year