Jakobovski / free-spoken-digit-dataset
A free audio dataset of spoken digits. An audio version of MNIST.
☆638Updated 10 months ago
Alternatives and similar repositories for free-spoken-digit-dataset:
Users that are interested in free-spoken-digit-dataset are comparing it to the libraries listed below
- ☆356Updated 11 months ago
- kapre: Keras Audio Preprocessors☆927Updated last year
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆647Updated 2 years ago
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )☆536Updated 3 years ago
- The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus.☆299Updated 3 years ago
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,002Updated last month
- UrbanSound classification using Convolutional Recurrent Networks in PyTorch☆385Updated 3 years ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,160Updated 3 years ago
- A Python wrapper for Kaldi☆1,008Updated last month
- ESC-50: Dataset for Environmental Sound Classification☆1,482Updated 11 months ago
- Real-time Voice Activity Detection in Noisy Eniviroments using Deep Neural Networks☆436Updated 4 years ago
- The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech proc…☆366Updated 2 months ago
- A flexible source separation library in Python☆628Updated 2 months ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆582Updated 3 years ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆743Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆376Updated last year
- A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.☆1,968Updated this week
- A library for speech data augmentation in time-domain☆655Updated 3 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆224Updated 4 years ago
- CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender …☆787Updated last month
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆384Updated 6 years ago
- An audio/acoustic activity detection and audio segmentation tool☆766Updated 2 months ago
- Audio processing by using pytorch 1D convolution network☆1,053Updated last year
- 🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆493Updated 3 years ago
- Urban sound classification using Deep Learning☆514Updated 2 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆78Updated 7 years ago
- A neural attention model for speech command recognition☆183Updated last year
- PyTorch implementations of neural network models for keyword spotting☆514Updated last year
- Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.☆851Updated 3 years ago
- Utterance-level Aggregation For Speaker Recognition In The Wild☆367Updated last year