Magicboomliu / Viseme-Classification
A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for Viseme-Classification
- The code generate phoneme from audio features.☆23Updated 3 years ago
- ☆11Updated 2 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated last year
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆14Updated last year
- ☆14Updated 2 months ago
- 中文到表情☆26Updated 2 years ago
- 基于DINet的推理服务,推理视频流和视频☆13Updated last year
- Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness (ICASSP 202…☆63Updated 9 months ago
- Music to Dance for 3D Avatar☆15Updated 3 years ago
- ☆96Updated 9 months ago
- CPU inference version of VisemeNet-tensorflow☆13Updated 5 years ago
- ☆27Updated last year
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- 3D Avatar Lip Synchronization from speech (JALI based face-rigging)☆73Updated 2 years ago
- Collections of papers, databases, and codes targeted at Digital Human☆29Updated 8 months ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆56Updated last year
- The ReprGesture entry to the GENEA Challenge 2022 (IMCI 2022)☆15Updated 2 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆32Updated 8 months ago
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆84Updated 4 years ago
- SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion☆89Updated 10 months ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated 8 months ago
- ☆91Updated 3 years ago
- ☆41Updated last year
- Talking head animation☆27Updated 11 months ago
- simple and fast wav2lip using onnx models for face-detection and inference. Easy installation☆22Updated last month
- Code for the project: "Audio-Driven Video-Synthesis of Personalised Moderations"☆17Updated 9 months ago
- Aligns faces to the canonical face in both videos and images☆17Updated 2 years ago
- Official pytorch implementation for "APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals", ICASSP'20☆63Updated 3 years ago