Audio data augmentation examples
☆34May 27, 2018Updated 7 years ago
Alternatives and similar repositories for audio-data-augmentation
Users that are interested in audio-data-augmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Sep 14, 2017Updated 8 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 3 years ago
- Translation and draft of Machine Learning Yearning for chapter 1-22.该书1-22章的翻译及原稿。☆10Aug 1, 2018Updated 7 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- ☆26Dec 3, 2018Updated 7 years ago
- <파이토치 첫걸음> 리포지토리☆17May 9, 2019Updated 6 years ago
- DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19☆19Nov 21, 2019Updated 6 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆99Jul 11, 2019Updated 6 years ago
- Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"☆29Jan 23, 2019Updated 7 years ago
- ☆18May 7, 2020Updated 5 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- These are my solutions to all six assignments of tensorflow tutorial in Udacity, covering CNN, RNN, Regularization (L2 and dropout), Embe…☆10Dec 16, 2016Updated 9 years ago
- Scene Classification using Audio in the nearby Environment.☆19Sep 4, 2019Updated 6 years ago
- Cochlear.ai submission for dcase2018 task2☆15Sep 14, 2018Updated 7 years ago
- Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-…☆26Sep 13, 2020Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Dec 30, 2017Updated 8 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Mar 21, 2018Updated 8 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Dec 26, 2022Updated 3 years ago
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …☆12Jun 3, 2018Updated 7 years ago
- A Pytorch implementation of triplet loss on VoxCeleb1☆12Oct 16, 2019Updated 6 years ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 10 years ago
- A machine learning application for emotion recognition from speech☆136Feb 6, 2018Updated 8 years ago
- JAMS annotation files for the original and augmented UrbanSound8K dataset☆35Jan 31, 2018Updated 8 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆133Dec 2, 2020Updated 5 years ago
- TensorFlow,DCGAN,VAE,LSTM,CNN,Acoustic Scene Classification☆11Jun 5, 2019Updated 6 years ago
- DCASE2016 TASK1 Scene Classification☆12May 2, 2017Updated 8 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆55Jun 11, 2017Updated 8 years ago
- Matlab implementation of the paper Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging☆75Aug 1, 2017Updated 8 years ago
- Kaldi based speaker verification☆47Jan 26, 2018Updated 8 years ago
- Sound source localization using SRP-PHAT☆25Feb 17, 2019Updated 7 years ago
- Contains code for a voting classifier that is part of an ensemble learning model for tweet classification (which includes an LSTM, a baye…☆23May 8, 2018Updated 7 years ago
- TensorFlow implementation of "Multimodal Speech Emotion Recognition using Audio and Text," IEEE SLT-18☆297Jun 17, 2024Updated last year
- This file is an implementation of the algorithm proposed in paper 'Phase-Based Dual-Microphone Robust Speech Enhancement'.☆18Aug 22, 2018Updated 7 years ago
- 2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告☆33Dec 28, 2018Updated 7 years ago