Metadata and versioning details for the Common Voice dataset
☆171Apr 10, 2026Updated last month
Alternatives and similar repositories for cv-dataset
Users that are interested in cv-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Apr 13, 2023Updated 3 years ago
- Linguistic processing for Common Voice☆59Jan 18, 2024Updated 2 years ago
- Tool to collect and review sentences for Common Voice☆83May 10, 2023Updated 3 years ago
- Scraping Wikipedia for fair use sentences☆54Jan 25, 2024Updated 2 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A voice driven 3D chess game for learning Voice AI☆17Jul 6, 2022Updated 3 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39May 5, 2026Updated 3 weeks ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆27Dec 4, 2023Updated 2 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆28Feb 22, 2022Updated 4 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆41Feb 4, 2026Updated 3 months ago
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆43Sep 29, 2021Updated 4 years ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆16Mar 26, 2022Updated 4 years ago
- Few-Shot Keyword Spotting☆73Apr 11, 2021Updated 5 years ago
- Mozilla Voice Community Playbook☆48May 21, 2024Updated 2 years ago
- Common Voice is part of Mozilla's initiative to help teach machines how real people speak.☆3,468May 22, 2026Updated last week
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- Command line tool to create corpora for Common Voice☆78Mar 25, 2026Updated 2 months ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆10Feb 22, 2022Updated 4 years ago
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Mar 8, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Extensions to YAML syntax for better python interaction☆80Jan 1, 2026Updated 4 months ago
- Massively multilingual pronunciation mining☆367May 22, 2026Updated last week
- Mycroft's multilingual text parsing and formatting library☆78Aug 14, 2023Updated 2 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Bayesian spEEch Recognizer☆55Jan 11, 2021Updated 5 years ago
- ☆231Nov 13, 2023Updated 2 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 11 months ago
- Gestion des activités de la communauté MozFR.☆31Jan 11, 2026Updated 4 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- VAE Tacotron 2, an alternative of GST Tacotron☆91Jul 6, 2023Updated 2 years ago
- Large, modern dataset for speech recognition☆726Feb 26, 2024Updated 2 years ago
- DEPRECATED - A crash course for training speech recognition models using DeepSpeech.☆24May 16, 2021Updated 5 years ago
- List of speech synthesis papers.☆1,071Jul 24, 2023Updated 2 years ago
- An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…☆414Aug 29, 2023Updated 2 years ago
- Multilingual G2P in 100 languages☆384May 26, 2023Updated 3 years ago
- These are various scripts to manipulate and/or measure the acoustic properties of speech sounds☆15Oct 18, 2024Updated last year