netankit / AudioMLProject3
Emotion recognition of Speaker's Speech Data. Employ speaker detection classifiers for emotion recognition, a multiclass classification problem. Emotion Classes: Happy, Sad, Neutral, Relaxed and Angry
☆16Updated 9 years ago
Alternatives and similar repositories for AudioMLProject3:
Users that are interested in AudioMLProject3 are comparing it to the libraries listed below
- An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!☆86Updated 4 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 8 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 7 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆35Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.☆47Updated 8 years ago
- Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.☆29Updated 7 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆38Updated 7 years ago
- WaveNet Vocoder Samples☆23Updated 5 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 6 years ago
- Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab☆46Updated 7 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 7 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow models☆39Updated 7 months ago
- Supplementary information and code for INTERSPEECH 2018 paper: Singing voice phoneme segmentation by hierarchically inferring syllable an…☆46Updated 6 years ago
- Classification of environmental sounds using first order statistics and GLCM (Gray-Level Co-Occurrence Matrix ) features of a spectrogram…☆24Updated 4 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 8 years ago
- Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN…☆14Updated 4 years ago
- ☆15Updated 5 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- A deep learning solution to the Query By Singing/Humming (QBSH) problem in Music Information Retrieval (MIR).☆15Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- CTC for emotion recognition☆60Updated 7 years ago
- Tensorflow Implementation of WaveGlow☆37Updated 4 years ago
- Mapping features using Deep Neural Networks (DNNs) with application to Voice Conversion (VC). The implementations are on top of Theano Py…☆33Updated 6 years ago
- ☆15Updated 5 years ago
- Scene Classification using Audio in the nearby Environment.☆19Updated 5 years ago
- A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popu…☆19Updated 7 years ago
- A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Waven…☆52Updated 6 years ago