Magicboomliu / Viseme-ClassificationLinks
A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification
☆14Updated 4 years ago
Alternatives and similar repositories for Viseme-Classification
Users that are interested in Viseme-Classification are comparing it to the libraries listed below
Sorting:
- The code generate phoneme from audio features.☆31Updated 4 years ago
- Pytorch reimplementation of audio driven face mesh or blendshape models, including Audio2Mesh, VOCA, etc☆16Updated last year
- ☆12Updated 3 years ago
- 3D Avatar Lip Synchronization from speech (JALI based face-rigging)☆82Updated 3 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated 2 years ago
- 中文到表情☆31Updated 3 years ago
- ☆15Updated last year
- ☆95Updated 4 years ago
- ☆34Updated 3 years ago
- SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion☆122Updated last year
- Speech to Facial Animation using GANs☆40Updated 4 years ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18Updated 2 years ago
- ☆28Updated 2 years ago
- ☆48Updated 2 years ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 3 years ago
- lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based…☆137Updated 9 months ago
- 基于DINet的推理服务,推理视频流和视频☆17Updated 2 years ago
- ☆102Updated 2 weeks ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- ☆34Updated 3 years ago
- Blender add-on to implement VOCA neural network.☆61Updated 3 years ago
- Drive your metahuman to speak within 1 second.☆12Updated 7 months ago
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆20Updated last year
- Talking head animation☆28Updated last year
- 📖 A curated list of resources dedicated to avatar.☆60Updated last year
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆87Updated 5 years ago
- repo collection for NVIDIA Audio2Face-3D models and tools☆103Updated last month
- CPU inference version of VisemeNet-tensorflow☆14Updated 6 years ago
- Collections of papers, databases, and codes targeted at Digital Human☆42Updated last year
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆53Updated 3 years ago