flyingshan/chinese_speech_feature_extraction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/flyingshan/chinese_speech_feature_extraction)

flyingshan / chinese_speech_feature_extraction

Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech.

☆21

Alternatives and similar repositories for chinese_speech_feature_extraction

Users that are interested in chinese_speech_feature_extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eugeneteoh / chromakey
View on GitHub
Chroma key (green screen removal) algorithms with Python
☆10Jul 14, 2024Updated 2 years ago
mkara44 / liveportrait_talker
View on GitHub
☆39Nov 10, 2024Updated last year
EternalDusk / LipSyncVideoGenerator
View on GitHub
Automatically generate a lip-synced avatar based off of a transcript and audio
☆15Feb 17, 2023Updated 3 years ago
Kafeyun / Wav2Lip-Ultra
View on GitHub
复现Wav2Lip作者新的论文
☆20Jun 20, 2023Updated 3 years ago
sky24h / Free-View_Expressive_Talking_Head_Video_Editing
View on GitHub
Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)
☆12May 26, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
peterwisu / lip-synthesis
View on GitHub
Audio-Visual Lip Synthesis via Intermediate Landmark Representation
☆19May 16, 2023Updated 3 years ago
Holasyb918 / PersonaTalk_Hack
View on GitHub
PersonaTalk Hack
☆16Jan 10, 2025Updated last year
USTC3DV / NeRFBlendShape-code
View on GitHub
☆223Aug 12, 2023Updated 2 years ago
ashawkey / RAD-NeRF
View on GitHub
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
☆927Apr 4, 2024Updated 2 years ago
julianyulu / Wav2LipHD
View on GitHub
☆24Oct 8, 2021Updated 4 years ago
neeek2303 / EMOPortraits
View on GitHub
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
☆397Apr 8, 2025Updated last year
yoonhachoe / FaceReenactment
View on GitHub
Facial Reenactment from Sparse Landmarks using StyleGAN3
☆11Aug 18, 2024Updated last year
sstzal / DFRF
View on GitHub
[ECCV2022] The implementation for "Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis".
☆338Jan 10, 2023Updated 3 years ago
Spycsh / xtalker
View on GitHub
Faster Talking Face Animation on Xeon CPU
☆127Nov 14, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LWprogramming / audiolm-pytorch-training
View on GitHub
audiolm-pytorch training code
☆15Jul 31, 2023Updated 2 years ago
IronSpiderMan / MuseTalkPlus
View on GitHub
基于MuseTalk的数字人代码。
☆33Sep 14, 2024Updated last year
tambetm / face_kiosk
View on GitHub
☆17Apr 3, 2017Updated 9 years ago
MingtaoGuo / StyleSwap
View on GitHub
Unofficial implementation of the paper: StyleSwap: Style-Based Generator Empowers Robust Face Swapping
☆51Oct 26, 2022Updated 3 years ago
MoayedHajiAli / VidStyleODE-official
View on GitHub
☆18Jul 16, 2024Updated 2 years ago
deepkyu / ml-talking-face
View on GitHub
Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)
☆54Sep 29, 2022Updated 3 years ago
FedeNoce / s2l-s2d
View on GitHub
[ICIAP 2023] Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation
☆59Dec 12, 2023Updated 2 years ago
primepake / wav2lip_288x288
View on GitHub
Wav2Lip version 288 and pipeline to train
☆648Aug 13, 2025Updated 11 months ago
HassanMuhammadSannaullah / Wav2lip-Fix-For-Inference
View on GitHub
This project fixes the Wav2Lip project so that it can run on Python 3.9. Wav2Lip is a project that can be used to lip-sync videos to audi…
☆17Aug 31, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zzc-1998 / GMS-3DQA
View on GitHub
Official repo for "GMS-3DQA: Projection-based Grid Mini-patch Sampling for 3D Model Quality Assessment"
☆14Mar 10, 2024Updated 2 years ago
asdMild / chinese-audio2face
View on GitHub
中文到表情
☆30May 12, 2022Updated 4 years ago
Sxjdwang / TalkLip
View on GitHub
☆429Nov 1, 2023Updated 2 years ago
zhangnn520 / digitalAvatarRealtime
View on GitHub
基于DINet的推理服务，推理视频流和视频
☆17Nov 8, 2023Updated 2 years ago
Smorodov / nano_bfm
View on GitHub
Basel morphable face model mesh and texture generator using GPU.
☆14Sep 14, 2020Updated 5 years ago
jadewu / 3D-Human-Face-Reconstruction-with-3DMM-face-model-from-RGB-image
View on GitHub
Reconstruct 3D model from 2D human face images and CNN based PCA generation.
☆45Jul 24, 2021Updated 5 years ago
yeyupiaoling / YeAudio
View on GitHub
Python的音频工具
☆16Dec 5, 2025Updated 7 months ago
SeanWangJS / grid-sample3d-trt-plugin
View on GitHub
TensorRT plugin for 3-dimension grid sample operator
☆52Feb 26, 2025Updated last year
yunik1004 / SAiD
View on GitHub
SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion
☆135Jan 25, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
yufan1012 / MonoGaussianAvatar
View on GitHub
☆145Sep 27, 2024Updated last year
zzj1111 / Preprocessed-CMLR-Dataset-For-Wav2Lip
View on GitHub
Considering the original Wav2Lip was trained on LSR2 and didn't have good performance on Chinese. I preprocessed CMLR Dataset and would t…
☆63Sep 23, 2023Updated 2 years ago
neopenx / Facial-Expression
View on GitHub
Facial-Expression Recognition with Deep Neural Networks
☆10Mar 6, 2016Updated 10 years ago
fzinfz / book
View on GitHub
Tech notes for mkdocs and gitbook
☆18Jul 16, 2026Updated last week
wujinzhong / Wav2Lip_TensorRT
View on GitHub
☆29Oct 1, 2023Updated 2 years ago
jesonxiang / cpp_extension_pybind11
View on GitHub
A demo project demonstrating the performance improvement by cpp extension, which wrapped with pybind11.
☆10Nov 16, 2021Updated 4 years ago
TencentYoutuResearch / FaceRestoration-sgpn
View on GitHub
Code for CVPR 2022 paper "Blind Face Restoration via Integrating Face Shape and Generative Priors"
☆25Jan 4, 2023Updated 3 years ago