lucaArrotta / Age-Estimation-based-on-Human-Voice
Human age estimation using deep neural networks (Keras)
☆12Updated last year
Alternatives and similar repositories for Age-Estimation-based-on-Human-Voice:
Users that are interested in Age-Estimation-based-on-Human-Voice are comparing it to the libraries listed below
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- ☆64Updated 6 months ago
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated last year
- ☆64Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- ☆33Updated last year
- ☆69Updated last year
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆234Updated last year
- Official Pytorch Implementation of "Diff-HierVC: Diffusion-based Hierarchical Voice Conversion with Robust Pitch Generation and Masked Pr…☆213Updated 8 months ago
- Implementation of Emo-StarGAN☆45Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated 11 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- dog-can-sing-song☆22Updated 5 months ago
- Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation☆113Updated last year
- The official implementation of EmoSphere++☆80Updated 2 weeks ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆122Updated 2 years ago
- ☆140Updated last year
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆23Updated last year
- ☆115Updated 2 years ago
- Using joint training speaker encoder with consistency loss to achieve cross-lingual voice conversion and expressive voice conversion☆144Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆145Updated last year
- How to use our public wav2vec2 age and gender model☆39Updated last year
- ☆21Updated 3 years ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆144Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆137Updated 5 months ago
- The Introduction of the OLKAVS Dataset☆31Updated 10 months ago