harshel / AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION
This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks for the speech classification. The dataset use for this task belongs to google brain when they hosted the competition on Kaggle. The name of the challenge was TensorFlow Speech Recognition Challenge.
☆13Updated 6 years ago
Alternatives and similar repositories for AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION
Users that are interested in AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION are comparing it to the libraries listed below
Sorting:
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆174Updated last year
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 3 years ago
- Identify the emotion of multiple speakers in an Audio Segment☆171Updated 2 years ago
- Collection of research papers on cough classification☆39Updated 5 years ago
- Classifying Audio to Emotion☆28Updated 5 years ago
- List of articles related to deep learning applied to music☆94Updated 5 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- ☆45Updated 7 years ago
- ☆90Updated 2 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆137Updated 4 years ago
- Different methods and techniques for features extraction from audio☆56Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated 10 months ago
- Machine Learning Sound Classifier☆135Updated 5 years ago
- open-source audio datasets☆150Updated last year
- Code examples for the book "Deep Learning for Audio: A Comprehensive Journey From Theory to Deployment"☆18Updated 5 years ago
- ☆118Updated 4 years ago
- A collection of Audio and Speech pre-trained models.☆190Updated 4 years ago
- Time series course Fall 2019 project☆54Updated 4 years ago
- Udacity 2018 Machine Learning Nanodegree Capstone project☆146Updated 6 years ago
- Data repository of Project Coswara☆187Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- ☆29Updated 6 years ago
- Urdu Language Speech Emotional Corpus☆45Updated 6 years ago
- ☆23Updated 4 years ago
- Music genre classification using Convolutional Neural Networks on Spectrograms in PyTorch☆39Updated 5 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆75Updated 4 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆121Updated 11 months ago