Audio data augmentation examples
☆34May 27, 2018Updated 7 years ago
Alternatives and similar repositories for audio-data-augmentation
Users that are interested in audio-data-augmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python3 implementation of the normalized and unnormalized spectral clustering algorithms☆12Jul 3, 2019Updated 6 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Keras Functional API for multiple inputs and mixed data☆11Feb 18, 2019Updated 7 years ago
- ☆26Dec 3, 2018Updated 7 years ago
- Learning embeddings for laughter categorization☆34Nov 3, 2018Updated 7 years ago
- <파이토치 첫걸음> 리포지토리☆17May 9, 2019Updated 6 years ago
- DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19☆19Nov 21, 2019Updated 6 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆99Jul 11, 2019Updated 6 years ago
- Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"☆29Jan 23, 2019Updated 7 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- These are my solutions to all six assignments of tensorflow tutorial in Udacity, covering CNN, RNN, Regularization (L2 and dropout), Embe…☆10Dec 16, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Cochlear.ai submission for dcase2018 task2☆15Sep 14, 2018Updated 7 years ago
- Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-…☆27Sep 13, 2020Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Dec 30, 2017Updated 8 years ago
- Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…☆18May 3, 2015Updated 10 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Dec 26, 2022Updated 3 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆38Mar 21, 2018Updated 8 years ago
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- A Java toolkit to generate multi fonts Arabic text images☆11Sep 2, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …☆12Jun 3, 2018Updated 7 years ago
- A Pytorch implementation of triplet loss on VoxCeleb1☆12Oct 16, 2019Updated 6 years ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 10 years ago
- A machine learning application for emotion recognition from speech☆136Feb 6, 2018Updated 8 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆133Dec 2, 2020Updated 5 years ago
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 6 years ago
- Coptic NLP pipeline page and utilities☆17Feb 11, 2025Updated last year
- DCASE2016 TASK1 Scene Classification☆12May 2, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆55Jun 11, 2017Updated 8 years ago
- Kaldi based speaker verification☆47Jan 26, 2018Updated 8 years ago
- Matlab implementation of the paper Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging☆75Aug 1, 2017Updated 8 years ago
- Sound source localization using SRP-PHAT☆25Feb 17, 2019Updated 7 years ago
- Contains code for a voting classifier that is part of an ensemble learning model for tweet classification (which includes an LSTM, a baye…☆23May 8, 2018Updated 7 years ago
- An audio steganalysis method based on CNN in the time domain.☆12Feb 25, 2021Updated 5 years ago
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆297Jun 17, 2024Updated last year