harshel / AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION
This repository consists of the IPython Notebook for the work related to audio processing and implementing convolution neural networks for the speech classification. The dataset use for this task belongs to google brain when they hosted the competition on Kaggle. The name of the challenge was TensorFlow Speech Recognition Challenge.
☆13Updated 6 years ago
Alternatives and similar repositories for AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION:
Users that are interested in AUDIO-PREOCESSING-AND-SPEECH-CLASSIFICATION are comparing it to the libraries listed below
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆66Updated 4 years ago
- ☆46Updated 7 years ago
- Sound Classification using Neural Networks☆49Updated 2 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- ☆90Updated 2 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆19Updated 2 years ago
- Urban sounds classification with Covnolutional Neural Networks☆36Updated 5 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Code examples for the book "Deep Learning for Audio: A Comprehensive Journey From Theory to Deployment"☆18Updated 4 years ago
- This project is about performing Speaker diarization for Hindi Language.☆48Updated 3 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆117Updated 8 months ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆75Updated 4 years ago
- Urban sound source tagging from an aggregation of four second noisy audio clips via 1D and 2D CNN (Xception)☆58Updated last year
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 3 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆170Updated 11 months ago
- Udacity 2018 Machine Learning Nanodegree Capstone project☆146Updated 6 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- ♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).☆81Updated 8 months ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- List of articles related to deep learning applied to music☆94Updated 5 years ago
- Speaker Diarization using GRU in PyTorch☆11Updated 4 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Updated 4 years ago
- Time series course Fall 2019 project☆54Updated 4 years ago
- Text to Speech for Indic languages☆50Updated 2 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆41Updated 2 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- Classifying 10 different categories of Sound using Deep Learning.☆25Updated 6 years ago