flyingshan / chinese_speech_feature_extractionLinks
Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech.
☆21Updated 2 years ago
Alternatives and similar repositories for chinese_speech_feature_extraction
Users that are interested in chinese_speech_feature_extraction are comparing it to the libraries listed below
Sorting:
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- wav2lip in a Vector Quantized (VQ) space☆28Updated 2 years ago
- ☆28Updated last year
- 📖 A curated list of resources dedicated to avatar.☆59Updated 8 months ago
- R2-Talker: Realistic Real-Time Talking Head Synthesis with Hash Grid Landmarks Encoding and Progressive Multilayer Conditioning☆80Updated last year
- Something about Talking Head Generation☆32Updated last year
- ☆24Updated 3 years ago
- PersonaTalk Hack☆13Updated 6 months ago
- Wav2Lip-Emotion extends Wav2Lip to modify facial expressions of emotions via L1 reconstruction and pre-trained emotion objectives. We als…☆96Updated 3 years ago
- Aim to accelerate the image-animation-model inference through the inference frameworks such as onnx、tensorrt and openvino.☆76Updated last year
- 复现Wav2Lip作者新的论文☆20Updated 2 years ago
- A Real-Time High-Definition Teeth Restoration Network for ArbitraryTalking Face Generation Methods☆141Updated last year
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18Updated 2 years ago
- ☆123Updated last year
- Psyche AI Inc release source "CVCUDA_FaceStoreHelper"☆67Updated 2 years ago
- ☆34Updated 3 years ago
- Faster Talking Face Animation on Xeon CPU☆130Updated last year
- ☆32Updated last year
- PyTorch implementation of "StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator"☆213Updated last year
- ☆73Updated 2 years ago
- [ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation☆62Updated last year
- One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior, CVPRW 2024☆61Updated 8 months ago
- An optimized pipeline for DINet reducing inference latency for up to 60% 🚀. Kudos for the authors of the original repo for this amazing …☆107Updated last year
- This is a project about talking faces. We use 576X576 sized facial images for training, which can generate 2k, 4k, 6k, and 8k digital hum…☆54Updated last year
- wav2lip训练数据预处理综合工具☆40Updated last year
- Speech-Driven Expression Blendshape Based on Single-Layer Self-attention Network (AIWIN 2022)☆76Updated 2 years ago
- Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".☆209Updated last year
- 基于DINet的推理服务,推理视频流和视频☆16Updated last year
- ☆8Updated last year
- Just a suturing monster project.☆41Updated last year