common-voice / cv-dataset
Metadata and versioning details for the Common Voice dataset
☆142Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for cv-dataset
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆434Updated 7 months ago
- Linguistic processing for Common Voice☆52Updated 10 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆224Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆321Updated 6 months ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆283Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆512Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆285Updated last week
- Variational Bayes HMM over x-vectors diarization☆254Updated 10 months ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆157Updated 3 weeks ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆138Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Various speech datasets made available to the public☆99Updated 2 months ago
- Large, modern dataset for speech recognition☆649Updated 8 months ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆235Updated 5 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆100Updated last year
- Command line tool to create corpora for Common Voice☆75Updated 5 months ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆273Updated 10 months ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆324Updated 2 years ago
- Multilingual G2P in 100 languages☆288Updated last year
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆183Updated 2 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- A pure python module for reading and writing kaldi ark files☆249Updated last year
- Collection of pretrained models for the Montreal Forced Aligner☆116Updated 4 months ago
- A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…☆144Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆340Updated 2 years ago
- A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf☆363Updated 3 years ago