common-voice/cv-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/common-voice/cv-dataset)

common-voice / cv-dataset

Metadata and versioning details for the Common Voice dataset

☆173

Alternatives and similar repositories for cv-dataset

Users that are interested in cv-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

common-voice / common-voice-bundler
View on GitHub
Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language
☆11Apr 13, 2023Updated 3 years ago
common-voice / sentence-collector
View on GitHub
Tool to collect and review sentences for Common Voice
☆83May 10, 2023Updated 3 years ago
ftyers / commonvoice-utils
View on GitHub
Linguistic processing for Common Voice
☆59Jan 18, 2024Updated 2 years ago
common-voice / cv-sentence-extractor
View on GitHub
Scraping Wikipedia for fair use sentences
☆54Jan 25, 2024Updated 2 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HarikalarKutusu / 3d-voice-chess
View on GitHub
A voice driven 3D chess game for learning Voice AI
☆17Jul 6, 2022Updated 4 years ago
kamperh / vqwordseg
View on GitHub
Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.
☆39May 5, 2026Updated 2 months ago
zkmkarlsruhe / language-identification
View on GitHub
Spoken Language Identification on Common Voice and AudioSet using Deep Learning
☆42Feb 4, 2026Updated 5 months ago
wnhsu / ResDAVEnet-VQ
View on GitHub
Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"
☆28Feb 22, 2022Updated 4 years ago
drfeinberg / Parselmouth-Guides
View on GitHub
These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth
☆43Sep 29, 2021Updated 4 years ago
zerospeech / zerospeech2021_baseline
View on GitHub
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Oct 19, 2022Updated 3 years ago
megseekosh / dsp_tutorials
View on GitHub
I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …
☆12Feb 5, 2024Updated 2 years ago
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ArchitParnami / Few-Shot-KWS
View on GitHub
Few-Shot Keyword Spotting
☆73Apr 11, 2021Updated 5 years ago
jasonppy / word-discovery
View on GitHub
Word Discovery in Visually Grounded, Self-Supervised Speech Models
☆27Dec 4, 2023Updated 2 years ago
common-voice / community-playbook
View on GitHub
Mozilla Voice Community Playbook
☆48May 21, 2024Updated 2 years ago
common-voice / common-voice
View on GitHub
Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
☆3,475Updated this week
npuichigo / tarzan
View on GitHub
High-level API for tar-based dataset
☆12Feb 3, 2024Updated 2 years ago
common-voice / CorporaCreator
View on GitHub
Command line tool to create corpora for Common Voice
☆78Mar 25, 2026Updated 4 months ago
chorowski-lab / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆10Feb 22, 2022Updated 4 years ago
AsoSoft / AsoSoft-Speech-Corpus
View on GitHub
AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…
☆10Mar 8, 2022Updated 4 years ago
speechbrain / HyperPyYAML
View on GitHub
Extensions to YAML syntax for better python interaction
☆80Jan 1, 2026Updated 6 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
MycroftAI / lingua-franca
View on GitHub
Mycroft's multilingual text parsing and formatting library
☆78Aug 14, 2023Updated 2 years ago
So-Fann / VISinger
View on GitHub
☆55Aug 11, 2022Updated 3 years ago
michellecohn / praat-scripts
View on GitHub
These are various scripts to manipulate and/or measure the acoustic properties of speech sounds
☆15Oct 18, 2024Updated last year
mozfr / besogne
View on GitHub
Gestion des activités de la communauté MozFR.
☆31Jan 11, 2026Updated 6 months ago
JaesungHuh / av-diarization
View on GitHub
Audio-visual diarization pipeline used for creating VoxConverse dataset
☆22Jun 6, 2025Updated last year
CUNY-CL / wikipron
View on GitHub
Massively multilingual pronunciation mining
☆371Updated this week
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
mozilla / deepspeech-playbook
View on GitHub
DEPRECATED - A crash course for training speech recognition models using DeepSpeech.
☆24May 16, 2021Updated 5 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sarulab-speech / jtubespeech
View on GitHub
☆233Nov 13, 2023Updated 2 years ago
ajk77 / SimpleEMRSystem
View on GitHub
The Simple EMR System is a rapidly deployable and readily customizable electronic medical record (EMR) user interface for supporting labo…
☆20Dec 8, 2025Updated 7 months ago
common-voice / commonvoice-fr
View on GitHub
Tooling for producing French dataset for Common Voice
☆101Jan 20, 2025Updated last year
pgys / NoIze
View on GitHub
A selective noise filter architecture driven by a CNN and Wiener filter
☆17Nov 21, 2019Updated 6 years ago
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆731Feb 26, 2024Updated 2 years ago
jqueguiner / spleeter-as-a-service
View on GitHub
API implementation of Song Source spleeting from Spleeter by Deezer
☆13Mar 21, 2020Updated 6 years ago
wenet-e2e / speech-synthesis-paper
View on GitHub
List of speech synthesis papers.
☆1,074Jul 24, 2023Updated 3 years ago