common-voice / cv-dataset
Metadata and versioning details for the Common Voice dataset
☆146Updated 3 weeks ago
Alternatives and similar repositories for cv-dataset:
Users that are interested in cv-dataset are comparing it to the libraries listed below
- Linguistic processing for Common Voice☆55Updated last year
- Various speech datasets made available to the public☆116Updated 4 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆310Updated 5 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆158Updated 2 weeks ago
- Segment an audio file and obtain utterance alignments. (Python package)☆334Updated 11 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv …☆137Updated 4 months ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆133Updated last year
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆191Updated 3 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆194Updated 2 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆441Updated last year
- Variational Bayes HMM over x-vectors diarization☆268Updated last year
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆157Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆230Updated 2 years ago
- Advanced data structures for handling temporal segments with attached labels.☆111Updated 2 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆245Updated 8 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆530Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- Speaker embedding (d-vector) trained with GE2E loss☆280Updated last year
- Multilingual G2P in 100 languages☆320Updated last year
- Diarization scoring tools.☆240Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆106Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆286Updated 2 years ago
- Python library for handling audio datasets.☆137Updated last year
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆405Updated last year
- Grapheme To Phoneme☆71Updated 8 months ago
- Large, modern dataset for speech recognition☆670Updated last year
- Neural HMMs are all you need (for high-quality attention-free TTS)☆158Updated 2 weeks ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆242Updated 5 years ago