alibugra/audio-data-augmentation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alibugra/audio-data-augmentation)

alibugra / audio-data-augmentation

Audio data augmentation examples

☆34

Alternatives and similar repositories for audio-data-augmentation

Users that are interested in audio-data-augmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sid0710 / audio_data_augmentation
View on GitHub
☆26Sep 14, 2017Updated 8 years ago
zhangyk8 / Spectral-Clustering
View on GitHub
Python3 implementation of the normalized and unnormalized spectral clustering algorithms
☆12Jul 3, 2019Updated 7 years ago
david8862 / rnnoise
View on GitHub
Recurrent neural network for audio noise reduction
☆12Aug 18, 2022Updated 3 years ago
luan78zaoha / kaldi-timit-sre-ivector
View on GitHub
Develop speaker recognition model based on i-vector using TIMIT database
☆16Jul 4, 2019Updated 7 years ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yucc2018 / machine-learning-yearning
View on GitHub
Translation and draft of Machine Learning Yearning for chapter 1-22.该书1-22章的翻译及原稿。
☆10Aug 1, 2018Updated 7 years ago
kaituoxu / kaldi-ktnet1
View on GitHub
Kaldi extended by Kaituo XU with new features in nnet1.
☆12Dec 16, 2018Updated 7 years ago
lkamat / Opencv
View on GitHub
Keras Functional API for multiple inputs and mixed data
☆11Feb 18, 2019Updated 7 years ago
ankitshah009 / WALNet-Weak_Label_Analysis
View on GitHub
Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.
☆32Sep 13, 2023Updated 2 years ago
sverma88 / DeepCU-IJCAI19
View on GitHub
DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19
☆19Nov 21, 2019Updated 6 years ago
corticph / MSTmodel
View on GitHub
Code for https://arxiv.org/abs/1712.00254
☆18Dec 6, 2017Updated 8 years ago
ankuPRK / Emotion-Recognition-in-Hindi-Speech
View on GitHub
Classifying utterances in Hindi speech in one of the 8 emotional states (anger, fear, disgust, neutral, sad, happy, surprise, sarcastic) …
☆11Apr 28, 2016Updated 10 years ago
GreenHandLW / TF-GSC
View on GitHub
☆26Dec 3, 2018Updated 7 years ago
edufonseca / icassp19
View on GitHub
Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"
☆99Jul 11, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
imranparuk / speaker-recognition-3d-cnn
View on GitHub
Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"
☆29Jan 23, 2019Updated 7 years ago
lessonxmk / head_fusion
View on GitHub
☆18May 7, 2020Updated 6 years ago
leimao / Tensorflow_Assignment_Solutions
View on GitHub
These are my solutions to all six assignments of tensorflow tutorial in Udacity, covering CNN, RNN, Regularization (L2 and dropout), Embe…
☆10Dec 16, 2016Updated 9 years ago
anujdutt9 / Audio-Scene-Classification
View on GitHub
Scene Classification using Audio in the nearby Environment.
☆19Sep 4, 2019Updated 6 years ago
finejuly / dcase2018_task2_cochlearai
View on GitHub
Cochlear.ai submission for dcase2018 task2
☆15Sep 14, 2018Updated 7 years ago
embatbr / graduation-project
View on GitHub
Text-Independent Speaker Recognition Using Gaussian Mixture Models
☆12Jul 1, 2015Updated 11 years ago
LuisKay / Spec_ResNet
View on GitHub
Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-…
☆27Sep 13, 2020Updated 5 years ago
karolpiczak / paper-2017-DCASE
View on GitHub
The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data
☆39Dec 30, 2017Updated 8 years ago
netankit / AudioMLProject1
View on GitHub
Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…
☆18May 3, 2015Updated 11 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
hlt-mt / TranscRater
View on GitHub
An open-source tool for automatic speech recognition ASR quality estimation.
☆24Dec 12, 2019Updated 6 years ago
boozyguo / ClearWave
View on GitHub
Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)
☆38Mar 21, 2018Updated 8 years ago
tqbl / dcase2018_task2
View on GitHub
Surrey CVSSP DCASE 2018 Task 2 system
☆20Dec 26, 2022Updated 3 years ago
craftsoft-dev / flutter_keep_screen_on
View on GitHub
This plugin disables automatic screen off and prevents the screen from turning off.
☆12May 16, 2026Updated 2 months ago
MarioRuggieri / Emotion-Recognition-from-Speech
View on GitHub
A machine learning application for emotion recognition from speech
☆137Feb 6, 2018Updated 8 years ago
fadymedhat / MCLNN-theano
View on GitHub
Masked ConditionaL Neural Networks
☆15Jul 6, 2023Updated 3 years ago
mingukkang / MNIST-Tensorflow-Code
View on GitHub
It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …
☆12Jun 3, 2018Updated 8 years ago
justinsalamon / UrbanSound8K-JAMS
View on GitHub
JAMS annotation files for the original and augmented UrbanSound8K dataset
☆35Jan 31, 2018Updated 8 years ago
matln / voxceleb_triplet-loss
View on GitHub
A Pytorch implementation of triplet loss on VoxCeleb1
☆12Oct 16, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rohitma38 / cnn-onset-detection
View on GitHub
☆11Dec 22, 2020Updated 5 years ago
JunhoKim94 / ASR_project
View on GitHub
This repository created for the NHN ASR hackathon competition.
☆11Sep 20, 2023Updated 2 years ago
deeplearningzhy / DL
View on GitHub
TensorFlow，DCGAN，VAE，LSTM，CNN，Acoustic Scene Classification
☆11Jun 5, 2019Updated 7 years ago
msfasha / TextImagesToolkit
View on GitHub
A Java toolkit to generate multi fonts Arabic text images
☆11Sep 2, 2021Updated 4 years ago
qiuqiangkong / DCASE2016_Task1
View on GitHub
DCASE2016 TASK1 Scene Classification
☆12May 2, 2017Updated 9 years ago
BenjaminDoran / Urban-Sound-Classification
View on GitHub
Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers
☆55Jun 11, 2017Updated 9 years ago
david-yoon / multimodal-speech-emotion
View on GitHub
TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18
☆297Jun 17, 2024Updated 2 years ago