A pipeline from Dataset Gathering,Data annotations, Model training,Model Evaluation for viseme (visual sound phoneme) classification
☆15Jan 19, 2021Updated 5 years ago
Alternatives and similar repositories for Viseme-Classification
Users that are interested in Viseme-Classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code generate phoneme from audio features.☆34Jun 15, 2021Updated 4 years ago
- ☆195Jul 15, 2021Updated 4 years ago
- VTuber application which only requires your voice and microphone, no need for a webcam or other tracking nonsense.☆18Jun 4, 2025Updated 11 months ago
- ☆97Jun 23, 2021Updated 4 years ago
- Tools for convert Text to IPA in python☆19Feb 11, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- CPU inference version of VisemeNet-tensorflow☆14Nov 6, 2019Updated 6 years ago
- JAX implementation of VQVAE/VQGAN autoencoders (+FSQ)☆41Jun 6, 2024Updated last year
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Pytorch reimplementation of audio driven face mesh or blendshape models, including Audio2Mesh, VOCA, etc☆17Sep 6, 2024Updated last year
- ☆10May 30, 2024Updated last year
- USC CS621 Course Project☆26Apr 22, 2023Updated 3 years ago
- Lip animation app for 3D face models.☆27Sep 14, 2025Updated 8 months ago
- CLI tool for recording or replaying Epic Games' live link face capture frames.☆82Oct 13, 2023Updated 2 years ago
- C++ 11 minifloat type implementation☆14Aug 3, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Jun 21, 2023Updated 2 years ago
- MMSE STSA Speech enhancement☆15Aug 24, 2015Updated 10 years ago
- ☆11Aug 9, 2022Updated 3 years ago
- Dynamically typed N-D expression system based on xtensor☆25Oct 20, 2021Updated 4 years ago
- Code for the paper EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models.☆16Mar 1, 2022Updated 4 years ago
- Unreal plugin with a CameraActor that captures RGB-D data and publishes it via TCP☆13Nov 1, 2024Updated last year
- a compact audio-to-phoneme aligner for singing voice☆12Jan 17, 2024Updated 2 years ago
- implementation based on "Audio-Driven Facial Animation by Joint End-to-End Learning of Pose and Emotion"☆164Apr 7, 2020Updated 6 years ago
- ☆48Aug 10, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Oct 26, 2023Updated 2 years ago
- Audio2Face Avatar with Riva SDK functionality☆74Jan 2, 2023Updated 3 years ago
- ☆16Aug 22, 2017Updated 8 years ago
- benchmarking miopen☆17Jan 14, 2019Updated 7 years ago
- 海思Hi3559移植YOLO☆13Sep 30, 2020Updated 5 years ago
- ECE 535 - Course Project, Deep Learning Framework☆76Jul 26, 2018Updated 7 years ago
- How to export Hugging Face's 🤗 NLP Transformers models to ONNX and use the exported model with the appropriate Transformers pipeline.☆25Apr 19, 2022Updated 4 years ago
- This is a labeling tool for Challenging Events for Person Detection from Overhead Fisheye Images (CEPDOF) fisheye dataset.☆12Oct 15, 2020Updated 5 years ago
- paraformer(chinense asr) online onnx runtime for python☆54Mar 27, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆42Jan 4, 2024Updated 2 years ago
- ☆15Feb 5, 2020Updated 6 years ago
- This is the official source for our ACM MM 2023 paper "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking …☆143Dec 5, 2023Updated 2 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32May 16, 2019Updated 7 years ago
- Parallel Genetic Algorithm Library originally by David Levine from Argonne National Laboratory☆24Jul 16, 2025Updated 10 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Jul 15, 2024Updated last year
- ☆25Jun 2, 2022Updated 3 years ago