wikke / AudioRecognitionLinks
Google Speech Command Dataset Classification Neural Network, CNN, RNN
☆25Updated 8 years ago
Alternatives and similar repositories for AudioRecognition
Users that are interested in AudioRecognition are comparing it to the libraries listed below
Sorting:
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Updated 7 years ago
- ☆17Updated 8 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Updated 2 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Updated 9 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 6 years ago
- Audio data augmentation examples☆34Updated 7 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆45Updated 4 years ago
- A deep learning solution to the Query By Singing/Humming (QBSH) problem in Music Information Retrieval (MIR).☆15Updated 8 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Updated 2 years ago
- Convolutional neural networks for sound classification☆20Updated 7 years ago
- Cochlear.ai submission for dcase2018 task2☆15Updated 7 years ago
- THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is c…☆34Updated 7 years ago
- ☆27Updated 7 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆56Updated 8 years ago
- Dialect identification using Siamese network☆15Updated 8 years ago
- Training neural audio classifiers with few data − https://arxiv.org/abs/1810.10274☆60Updated 6 years ago
- Implemented 3 neural network architectures: 1) Combination of RNN LSTM nodes and CNN, 2) CNN with residual blocks similar to ResNet, 3) D…☆25Updated 7 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆31Updated 7 years ago
- Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN…☆14Updated 5 years ago
- Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow☆26Updated 7 years ago
- These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB☆50Updated 6 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Updated 3 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 7 years ago
- Implementation and reviews of Audio & Computer vision related papers in python using keras and tensorflow.☆40Updated 7 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆32Updated 6 years ago
- Stochastic Adaptive Neural Architecture Search☆65Updated 7 years ago
- Code for ICASSP 2019 paper☆18Updated 7 years ago
- Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"☆90Updated 4 years ago
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆85Updated 5 years ago
- DCASE2016 TASK1 Scene Classification☆12Updated 8 years ago