Jakobovski / free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
☆631Updated 8 months ago
Alternatives and similar repositories for free-spoken-digit-dataset:
Users that are interested in free-spoken-digit-dataset are comparing it to the libraries listed below
- ☆355Updated 10 months ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆381Updated 5 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,147Updated 3 years ago
- A neural attention model for speech command recognition☆183Updated last year
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆299Updated 2 years ago
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆501Updated 6 months ago
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆493Updated 3 years ago
- PyTorch implementations of neural network models for keyword spotting☆515Updated last year
- Voice Activity Detector in Python☆472Updated 4 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆645Updated 2 years ago
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆912Updated 9 months ago
- kapre: Keras Audio Preprocessors☆925Updated last year
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆375Updated last year
- A Python wrapper for Kaldi☆1,006Updated 5 months ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆580Updated 3 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlow☆829Updated last year
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆366Updated last month
- Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.☆220Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆535Updated 2 years ago
- Speech commands recognition with PyTorch | Kaggle 10th place solution in TensorFlow Speech Recognition Challenge☆197Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆429Updated 4 years ago
- Speaker embedding(verification and recognition) using Pytorch☆366Updated 4 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆77Updated 7 years ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆165Updated 5 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,791Updated 7 months ago
- Tensorflow implementation of "Generalized End-to-End Loss for Speaker Verification"☆362Updated 3 years ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆274Updated last year
- Utterance-level Aggregation For Speaker Recognition In The Wild☆367Updated last year
- An STFT/iSTFT for PyTorch.☆353Updated last year
- A list of publically available audio data that anyone can download for ASR or other speech activities☆202Updated 3 years ago