harshel / AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION
This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks for the speech classification. The dataset use for this task belongs to google brain when they hosted the competition on Kaggle. The name of the challenge was TensorFlow Speech Recognition Challenge.
☆13Updated 5 years ago
Alternatives and similar repositories for AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION:
Users that are interested in AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION are comparing it to the libraries listed below
- ☆46Updated 6 years ago
- Classifying Audio to Emotion☆28Updated 5 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆166Updated last year
- Collection of research papers on cough classification☆37Updated 4 years ago
- ☆90Updated 2 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆19Updated 2 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆80Updated 7 months ago
- A collection of Audio and Speech pre-trained models.☆183Updated 4 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆170Updated 10 months ago
- Classify daily life events using audio data.☆50Updated 4 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- Data repository of Project Coswara☆182Updated last year
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆67Updated 4 years ago
- Sound Classification using Neural Networks☆49Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆47Updated 3 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆117Updated 7 months ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- Audio preprocessing framework for Deep Learning audio applications☆124Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- List of articles related to deep learning applied to music☆93Updated 5 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆124Updated 4 years ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆91Updated last year
- A didactic toolkit to rapidly prototype audio classifiers with pre-trained Tensorflow models and Scikit-learn☆142Updated 2 years ago
- Identifying people from small audio fragments☆170Updated 4 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆75Updated 4 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆127Updated 3 years ago
- End-to-End Speech Recognition☆10Updated 3 years ago