Metadata and versioning details for the Common Voice dataset
☆167Feb 16, 2026Updated 2 weeks ago
Alternatives and similar repositories for cv-dataset
Users that are interested in cv-dataset are comparing it to the libraries listed below
Sorting:
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Apr 13, 2023Updated 2 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- Tool to collect and review sentences for Common Voice☆82May 10, 2023Updated 2 years ago
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- Scraping Wikipedia for fair use sentences☆54Jan 25, 2024Updated 2 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- Word Discovery in Visually Grounded, Self-Supervised Speech Models☆26Dec 4, 2023Updated 2 years ago
- Unsupervised phone and word segmentation using dynamic programming on self-supervised VQ features.☆39Mar 4, 2024Updated 2 years ago
- Tools for Ahocoder data processing and evaluation metrics☆15Apr 22, 2024Updated last year
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆60Oct 19, 2022Updated 3 years ago
- Mozilla Voice Community Playbook☆48May 21, 2024Updated last year
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆28Feb 22, 2022Updated 4 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 4 years ago
- ☆229Nov 13, 2023Updated 2 years ago
- Bayesian spEEch Recognizer☆55Jan 11, 2021Updated 5 years ago
- Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.☆223Oct 20, 2023Updated 2 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Network specification and demo☆35Jun 5, 2017Updated 8 years ago
- Large, modern dataset for speech recognition☆721Feb 26, 2024Updated 2 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- Massively multilingual pronunciation mining☆362Jan 13, 2026Updated last month
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆368Oct 12, 2021Updated 4 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆42Feb 4, 2026Updated last month
- These are Jupyter Notebooks to help guide people to learn how to use Praat-Parselmouth☆42Sep 29, 2021Updated 4 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- AI model designed to test the effectiveness in handling external ethical attacks.☆11Feb 9, 2026Updated 3 weeks ago
- 为visinger SVS系统写的展示系统~本质仍然是个音乐播放器☆11Apr 18, 2023Updated 2 years ago
- Paper Review about Speech Recognition · NLP☆10Mar 25, 2021Updated 4 years ago
- List of speech synthesis papers.☆1,067Jul 24, 2023Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24May 16, 2021Updated 4 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Jul 25, 2024Updated last year
- Command line tool to create corpora for Common Voice☆78Feb 16, 2026Updated 2 weeks ago
- Multilingual G2P in 100 languages☆375May 26, 2023Updated 2 years ago
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Shan Natural Language Processing tools inspired by PythaiNLP☆14Updated this week
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Nov 1, 2022Updated 3 years ago