A free audio dataset of spoken digits. An audio version of MNIST.
☆677May 2, 2024Updated 2 years ago
Alternatives and similar repositories for free-spoken-digit-dataset
Users that are interested in free-spoken-digit-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆372Jun 4, 2025Updated last year
- A Convolutional Neural Network to identify spoken digits.☆52Mar 13, 2022Updated 4 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,399Mar 14, 2022Updated 4 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆538Feb 9, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- PyTorch implementations of neural network models for keyword spotting☆524May 22, 2023Updated 3 years ago
- A Python wrapper for Kaldi☆1,034Nov 30, 2025Updated 6 months ago
- This is the repository for my version of Kaldi for Dummies example.☆17Nov 18, 2018Updated 7 years ago
- This library provides common speech features for ASR including MFCCs and filterbank energies.☆2,423Oct 20, 2021Updated 4 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 5 years ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆169Feb 23, 2021Updated 5 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆110Jan 11, 2023Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Jan 2, 2020Updated 6 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆386Mar 24, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Aug 6, 2015Updated 10 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,862Jun 27, 2022Updated 3 years ago
- Spiking neural networks (SNNs) for speech classification☆12Mar 14, 2022Updated 4 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- A library for speech data augmentation in time-domain☆688Aug 30, 2021Updated 4 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆654Apr 5, 2022Updated 4 years ago
- Python library for handling audio datasets.☆139Jul 6, 2023Updated 2 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆213Aug 7, 2025Updated 10 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,342May 20, 2026Updated 2 weeks ago
- ☆22Jul 22, 2022Updated 3 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Aug 6, 2022Updated 3 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 6 years ago
- ABX and kaldi experiments on speech corpora made easy☆33Oct 7, 2024Updated last year
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆55Jan 2, 2020Updated 6 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Painless Wiener filters for audio separation☆191May 4, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,279Apr 13, 2026Updated last month
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆2,881Updated this week
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆329Mar 5, 2022Updated 4 years ago
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆869Jun 9, 2021Updated 5 years ago
- A platform for the collaborative creation of open audio collections labeled by humans and based on Freesound content.☆144May 18, 2026Updated 3 weeks ago
- Paper: https://arxiv.org/abs/1702.02285☆64Dec 19, 2018Updated 7 years ago
- Voice Activity Detector in Python☆481Nov 17, 2020Updated 5 years ago