Magicboomliu / Viseme-ClassificationLinks
A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification
☆14Updated 4 years ago
Alternatives and similar repositories for Viseme-Classification
Users that are interested in Viseme-Classification are comparing it to the libraries listed below
Sorting:
- The code generate phoneme from audio features.☆31Updated 4 years ago
- Pytorch reimplementation of audio driven face mesh or blendshape models, including Audio2Mesh, VOCA, etc☆16Updated last year
- 3D Avatar Lip Synchronization from speech (JALI based face-rigging)☆82Updated 3 years ago
- ☆12Updated 3 years ago
- ☆15Updated last year
- Audio2Face Avatar with Riva SDK functionality☆74Updated 2 years ago
- lipsync is a simple and updated Python library for lip synchronization, based on Wav2Lip. It synchronizes lips in videos and images based…☆135Updated 9 months ago
- ☆95Updated 4 years ago
- SAiD: Blendshape-based Audio-Driven Speech Animation with Diffusion☆122Updated last year
- Speech to Facial Animation using GANs☆40Updated 3 years ago
- Spliting the ASR probability distribution results into the chinese pinyin, so as to extract more effective feature for the chinese speech…☆21Updated 2 years ago
- ☆102Updated last month
- 中文到表情☆31Updated 3 years ago
- ☆28Updated 2 years ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Updated 2 years ago
- Code for "Animating Portrait Line Drawings from a Single Face Photo and a Speech Signal"☆57Updated 3 years ago
- ☆34Updated 3 years ago
- repo collection for NVIDIA Audio2Face-3D models and tools☆79Updated last month
- Drive your metahuman to speak within 1 second.☆12Updated 7 months ago
- 基于DINet的推理服务,推理视频流和视频☆16Updated last year
- Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.☆86Updated 5 years ago
- ☆34Updated 3 years ago
- PyTorch implementation of NEUTART, a system that creates photorealistic talking avatars from an input text transcription.☆34Updated 7 months ago
- Web-first SDK that provides real-time ARKit-compatible 52 blend shapes from a camera feed, video or image at 60 FPS using ML models.☆87Updated 2 years ago
- Audio-Visual Lip Synthesis via Intermediate Landmark Representation☆18Updated 2 years ago
- Music to Dance for 3D Avatar☆16Updated 3 years ago
- Cloned repository from Hugging Face Spaces (CVPR 2022 Demo)☆53Updated 3 years ago
- 📖 A curated list of resources dedicated to avatar.☆60Updated 11 months ago
- Code for ACCV 2020 "Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses"☆100Updated 4 years ago
- Faster Talking Face Animation on Xeon CPU☆129Updated last year