A free audio dataset of spoken digits. An audio version of MNIST.
☆667May 2, 2024Updated last year
Alternatives and similar repositories for free-spoken-digit-dataset
Users that are interested in free-spoken-digit-dataset are comparing it to the libraries listed below
Sorting:
- ☆372Jun 4, 2025Updated 9 months ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,396Mar 14, 2022Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆537Feb 9, 2022Updated 4 years ago
- PyTorch implementations of neural network models for keyword spotting☆525May 22, 2023Updated 2 years ago
- A Python wrapper for Kaldi☆1,030Nov 30, 2025Updated 3 months ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,422Oct 20, 2021Updated 4 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Feb 23, 2021Updated 5 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Jan 11, 2023Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Jan 2, 2020Updated 6 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆382Mar 24, 2023Updated 2 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Aug 6, 2015Updated 10 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 4 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,865Jun 27, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- Python library for handling audio datasets.☆138Jul 6, 2023Updated 2 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆214Aug 7, 2025Updated 7 months ago
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,316Mar 9, 2026Updated last week
- ☆20Jul 22, 2022Updated 3 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Aug 6, 2022Updated 3 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 6 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆55Jan 2, 2020Updated 6 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,240Dec 27, 2025Updated 2 months ago
- Painless Wiener filters for audio separation☆191Feb 17, 2022Updated 4 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,844Updated this week
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆323Mar 5, 2022Updated 4 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆869Jun 9, 2021Updated 4 years ago
- A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.☆144Oct 6, 2023Updated 2 years ago
- Paper: https://arxiv.org/abs/1702.02285☆65Dec 19, 2018Updated 7 years ago
- Voice Activity Detector in Python☆480Nov 17, 2020Updated 5 years ago
- Benchmark popular audio i/o packages☆151Dec 19, 2023Updated 2 years ago
- Spoken language identification with deep learning☆233Jan 5, 2018Updated 8 years ago