drscotthawley / audio-classifier-keras-cnn
Audio Classifier in Keras using Convolutional Neural Network
☆160Updated 5 years ago
Alternatives and similar repositories for audio-classifier-keras-cnn:
Users that are interested in audio-classifier-keras-cnn are comparing it to the libraries listed below
- Deep Learning experiments for audio classification☆149Updated 7 years ago
- A multi-channel neural network audio classifier using Keras☆270Updated 3 years ago
- Transfer learning for music classification and regression tasks☆257Updated 5 years ago
- A convolutional neural network that classifies sounds☆160Updated 8 years ago
- A library for augmenting annotated audio data☆233Updated 3 years ago
- Environmental Sound Classification with Convolutional Neural Networks - paper replication data☆75Updated 7 years ago
- Tensorflow Implementation of Convolutional Recurrent Neural Networks for Music Genre Classification☆55Updated 8 years ago
- 8th place solution (on Kaggle) to the Freesound General-Purpose Audio Tagging Challenge (DCASE 2018 - Task 2)☆113Updated 4 years ago
- Tensorflow - Very Deep Convolutional Neural Networks For Raw Waveforms - https://arxiv.org/pdf/1610.00087.pdf☆74Updated 4 years ago
- Tensorflow implementation of the models used in "End-to-end learning for music audio tagging at scale"☆150Updated 5 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆130Updated 4 years ago
- Keras Interface for Kaldi ASR☆121Updated 7 years ago
- Spectrograms, MFCCs, and Inversion Demo in a jupyter notebook☆166Updated 5 years ago
- Trims .wav audio files to the loudest section of a given length☆95Updated 7 years ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 8 years ago
- CTC for emotion recognition☆60Updated 7 years ago
- ESC: Dataset for Environmental Sound Classification - paper replication data☆77Updated 7 years ago
- Music auto-tagging models and trained weights in keras/theano☆610Updated 6 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆123Updated 7 years ago
- CNN based Minimal model for recognizing word