common-voice / cv-dataset
Metadata and versioning details for the Common Voice dataset
☆145Updated 2 months ago
Alternatives and similar repositories for cv-dataset:
Users that are interested in cv-dataset are comparing it to the libraries listed below
- Linguistic processing for Common Voice☆53Updated last year
- Various speech datasets made available to the public☆113Updated 2 months ago
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech☆229Updated 2 years ago
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆285Updated last year
- Command line tool to create corpora for Common Voice☆75Updated 8 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 10 months ago
- The People’s Speech Dataset☆101Updated last year
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆103Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆328Updated 9 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆147Updated last month
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆130Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆96Updated last week
- Collection of pretrained models for the Montreal Forced Aligner☆130Updated 7 months ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Updated 3 years ago
- Variational Bayes HMM over x-vectors diarization☆263Updated last year
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆449Updated 10 months ago
- ☆39Updated last year
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆155Updated 5 years ago
- Multilingual G2P in 100 languages☆299Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆38Updated 2 years ago
- ☆90Updated 2 years ago
- ☆66Updated 2 months ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆172Updated 2 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆154Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆526Updated last year
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆122Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆303Updated 3 months ago
- A non-native English corpus for pronunciation scoring task☆123Updated 7 months ago