adhishthite / sound-mnist
A Convolutional Neural Network to identify spoken digits.
☆50Updated 3 years ago
Alternatives and similar repositories for sound-mnist:
Users that are interested in sound-mnist are comparing it to the libraries listed below
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆79Updated 7 years ago
- A walkthrough of how to make spectrograms in python that are customized for human speech research.☆39Updated last year
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- A neural attention model for speech command recognition☆185Updated 2 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆73Updated 3 years ago
- Audio transforms for FastAI☆194Updated 5 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 8 months ago
- CNN 1D vs 2D audio classification☆104Updated 6 years ago
- Music genre classification using Convolutional Neural Networks on Spectrograms in PyTorch☆39Updated 5 years ago
- Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.☆174Updated 7 years ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆166Updated 5 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- python3 version of pyaudioanalysis☆19Updated 6 years ago
- Built a deep neural network that functions as part of an end-to-end automatic speech recognition (ASR) pipeline.☆48Updated 6 years ago
- It uses GMM to train a gender detector model. The testing has been done on subset of Google's AudioSet corpus.☆19Updated 7 years ago
- Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.☆222Updated last year
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆224Updated 4 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Include some core functions and model to handle speech separation☆155Updated 3 years ago
- Deep Learning experiments for audio classification☆149Updated 7 years ago
- A simple audio feature extraction library☆80Updated 5 years ago
- Authors' implementation of DeepSpeech Distances.☆129Updated 5 years ago
- ☆18Updated 4 years ago
- ☆90Updated 2 years ago
- keras project for audio deep learning☆40Updated 7 years ago