harshel / AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION
This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks for the speech classification. The dataset use for this task belongs to google brain when they hosted the competition on Kaggle. The name of the challenge was TensorFlow Speech Recognition Challenge.
☆13Updated 5 years ago
Related projects: ⓘ
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆44Updated 3 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆164Updated 6 months ago
- ☆45Updated 6 years ago
- Collection of research papers on cough classification☆35Updated 4 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆157Updated last year
- ☆90Updated last year
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- Classifying Audio to Emotion☆27Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- Sound Classification using Neural Networks☆47Updated last year
- Speech to text transcription using RNN (Listen, Attend and Spell).☆11Updated 5 years ago
- Time series course Fall 2019 project☆52Updated 4 years ago
- End-to-End Speech Recognition☆8Updated 3 years ago
- Text to Speech for Indic languages☆49Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆85Updated last year
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆64Updated 3 years ago
- ☆17Updated 4 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- ☆49Updated this week
- Udacity 2018 Machine Learning Nanodegree Capstone project☆146Updated 5 years ago
- open-source audio datasets☆141Updated last year
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆112Updated 3 months ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆134Updated 3 years ago
- ☆16Updated this week
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated last year
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆77Updated 3 months ago
- ☆21Updated 4 years ago
- ☆22Updated 3 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆38Updated 2 years ago