Jakobovski / free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
☆626Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for free-spoken-digit-dataset
- ☆348Updated 8 months ago
- kapre: Keras Audio Preprocessors☆922Updated last year
- PyTorch implementations of neural network models for keyword spotting☆513Updated last year
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,140Updated 3 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆379Updated 5 years ago
- A Convolutional Neural Network to identify spoken digits.☆50Updated 2 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆641Updated 2 years ago
- The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.☆652Updated 6 years ago
- A PyTorch Implementation of End-to-End Models for Speech-to-Text☆754Updated last year
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,875Updated last week
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆963Updated 2 weeks ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆164Updated 5 years ago
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆296Updated 2 years ago
- Audio processing by using pytorch 1D convolution network☆1,032Updated 9 months ago
- Trims .wav audio files to the loudest section of a given length☆95Updated 6 years ago
- A Python wrapper for the high-quality vocoder "World"☆725Updated last year
- A neural attention model for speech command recognition☆180Updated last year
- Deep Speaker: an End-to-End Neural Speaker Embedding System.☆906Updated 7 months ago
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆383Updated 3 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆223Updated 4 years ago
- A flexible source separation library in Python☆622Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆978Updated last year
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆429Updated 4 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentation☆242Updated 6 years ago
- WaveGAN: Learn to synthesize raw audio with generative adversarial networks☆1,330Updated last year
- Web application to record speech for an open data set☆421Updated 4 years ago
- A python wrapper for Speech Signal Processing Toolkit (SPTK).☆441Updated 4 months ago
- Urban sound classification using Deep Learning☆512Updated 2 years ago
- Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for bea…☆1,465Updated this week
- Front-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a …☆240Updated last year