lucaArrotta / Age-Estimation-based-on-Human-Voice
Human age estimation using deep neural networks (Keras)
☆12Updated last year
Alternatives and similar repositories for Age-Estimation-based-on-Human-Voice:
Users that are interested in Age-Estimation-based-on-Human-Voice are comparing it to the libraries listed below
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- The Introduction of the OLKAVS Dataset☆31Updated 10 months ago
- ☆33Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆138Updated 6 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆81Updated last year
- PyTorch implementation of "Lip to Speech Synthesis in the Wild with Multi-task Learning" (ICASSP2023)☆65Updated last year
- ☆65Updated 7 months ago
- How to use our public wav2vec2 age and gender model☆39Updated last year
- Implementation of Emo-StarGAN☆45Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆119Updated 2 years ago
- ☆20Updated 3 years ago
- Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation☆114Updated last year
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆23Updated last year
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆48Updated this week
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆123Updated 2 years ago
- ☆65Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆74Updated last year
- ☆38Updated 7 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- [Interspeech 2024] SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization☆49Updated last month
- ☆21Updated 3 years ago
- ☆55Updated last year
- A curated list of speaker-embedding speaker-verification, speaker-identification resources.☆47Updated 3 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆28Updated last year
- Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach☆69Updated 2 years ago
- ☆64Updated last year
- High-Fidelity Neural Phonetic Posteriorgrams☆109Updated 2 months ago
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆116Updated 2 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Updated last year