Jakobovski/free-spoken-digit-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jakobovski/free-spoken-digit-dataset)

Jakobovski / free-spoken-digit-dataset

A free audio dataset of spoken digits. An audio version of MNIST.

☆678

Alternatives and similar repositories for free-spoken-digit-dataset

Users that are interested in free-spoken-digit-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

soerenab / AudioMNIST
View on GitHub
☆374Jun 4, 2025Updated last year
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,399Mar 14, 2022Updated 4 years ago
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
castorini / honk
View on GitHub
PyTorch implementations of neural network models for keyword spotting
☆526May 22, 2023Updated 3 years ago
YoavRamon / awesome-kaldi
View on GitHub
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
☆536Feb 9, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jameslyons / python_speech_features
View on GitHub
This library provides common speech features for ASR including MFCCs and filterbank energies.
☆2,423Oct 20, 2021Updated 4 years ago
JRMeyer / easy-kaldi
View on GitHub
Use your data to create a speech recognition system in Kaldi. Fast.
☆65Jan 2, 2020Updated 6 years ago
pykaldi / pykaldi
View on GitHub
A Python wrapper for Kaldi
☆1,038Nov 30, 2025Updated 7 months ago
dobby-seo / Wav2Keyword
View on GitHub
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆110Jan 11, 2023Updated 3 years ago
srinivr / kaldi-long-audio-alignment
View on GitHub
Long audio alignment using Kaldi
☆23Apr 22, 2021Updated 5 years ago
gooofy / py-kaldi-asr
View on GitHub
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
☆169Feb 23, 2021Updated 5 years ago
mindorii / kws
View on GitHub
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
☆387Mar 24, 2023Updated 3 years ago
NeelayS / speech_spike_signatures
View on GitHub
Spiking neural networks (SNNs) for speech classification
☆12Mar 14, 2022Updated 4 years ago
syhw / wer_are_we
View on GitHub
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,864Jun 27, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 3 years ago
arenjansen / ZRTools
View on GitHub
Zero-Resource Speech Discovery, Search, and Evaluation Tools
☆29Aug 6, 2015Updated 10 years ago
miras-tech / MirasVoice
View on GitHub
MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…
☆19Mar 15, 2020Updated 6 years ago
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated last week
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
ynop / audiomate
View on GitHub
Python library for handling audio datasets.
☆139Jul 6, 2023Updated 3 years ago
pytorch / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆2,915Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
iver56 / audiomentations
View on GitHub
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,302Apr 13, 2026Updated 3 months ago
ronggong / mispronunciation-detection
View on GitHub
Mispronunciation detection code for jingju singing voice
☆19Sep 5, 2018Updated 7 years ago
csukuangfj / kaldifeat
View on GitHub
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…
☆215Jul 10, 2026Updated last week
philipperemy / timit
View on GitHub
The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.
☆330Mar 5, 2022Updated 4 years ago
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
philipperemy / speaker-change-detection
View on GitHub
Paper: https://arxiv.org/abs/1702.02285
☆64Dec 19, 2018Updated 7 years ago
bootphon / abkhazia
View on GitHub
ABX and kaldi experiments on speech corpora made easy
☆34Oct 7, 2024Updated last year
marsbroshok / VAD-python
View on GitHub
Voice Activity Detector in Python
☆481Nov 17, 2020Updated 5 years ago
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sigsep / norbert
View on GitHub
Painless Wiener filters for audio separation
☆191May 4, 2026Updated 2 months ago
ReScience / call-for-replication
View on GitHub
Call for Replication in ReScience
☆13Oct 13, 2016Updated 9 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
JRMeyer / multi-task-kaldi
View on GitHub
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆55Jan 2, 2020Updated 6 years ago
MTG / freesound-datasets
View on GitHub
A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.
☆144May 18, 2026Updated 2 months ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,432Sep 22, 2025Updated 9 months ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago