wikke / AudioRecognition
Google Speech Command Dataset Classification Neural Network, CNN, RNN
☆24Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for AudioRecognition
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 6 years ago
- Audio data augmentation examples☆35Updated 6 years ago
- Audio command recognition by DTW and classification☆7Updated 3 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 7 years ago
- Bag-of-Features Acoustic Event Detection☆14Updated 8 years ago
- Random regression forests for audio event detection☆9Updated 7 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆28Updated 6 years ago
- Convolutional neural networks for sound classification☆20Updated 6 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆52Updated 7 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆19Updated last year
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 5 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated last year
- ☆16Updated 5 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 6 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆32Updated 6 years ago
- Real-time speech enhancement based on spectral subtraction☆14Updated 6 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Train a 4-layer Convolutional Neural Network to detect trigger word☆52Updated 6 years ago
- A python library to analyze tuning and intonation related stuff in melodies across various music traditions in the world.☆18Updated 2 months ago
- A program for automatic speaker identification using deep learning techniques.☆84Updated 7 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…☆24Updated 4 years ago
- ☆26Updated 6 years ago
- Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"☆89Updated 3 years ago
- System for Emotion Detection in given speech data using joint modelling of hand crafted prosody rich features , MFCC features and LSTM ba…☆10Updated 7 years ago
- This is part of code of a research on speech synthesizing for a low-resourced language: Gan, a Chinese dialect spoken primarily in Jiangx…☆17Updated 8 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 7 years ago