Audio data augmentation examples
☆34May 27, 2018Updated 8 years ago
Alternatives and similar repositories for audio-data-augmentation
Users that are interested in audio-data-augmentation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆26Sep 14, 2017Updated 8 years ago
- Python3 implementation of the normalized and unnormalized spectral clustering algorithms☆12Jul 3, 2019Updated 7 years ago
- Recurrent neural network for audio noise reduction☆12Aug 18, 2022Updated 3 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 7 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Translation and draft of Machine Learning Yearning for chapter 1-22.该书1-22章的翻译及原稿。☆10Aug 1, 2018Updated 7 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Dec 16, 2018Updated 7 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- ☆26Dec 3, 2018Updated 7 years ago
- Code for https://arxiv.org/abs/1712.00254☆17Dec 6, 2017Updated 8 years ago
- Learning embeddings for laughter categorization☆34Nov 3, 2018Updated 7 years ago
- <파이토치 첫걸음> 리포지토리☆17May 9, 2019Updated 7 years ago
- Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"☆99Jul 11, 2019Updated 6 years ago
- DeepCU: Integrating Both Common and Unique Latent Information for Multimodal Sentiment Analysis, IJCAI-19☆19Nov 21, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Keras + pyTorch implimentation of "Deep Learning & 3D Convolutional Neural Networks for Speaker Verification"☆29Jan 23, 2019Updated 7 years ago
- ☆18May 7, 2020Updated 6 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Cochlear.ai submission for dcase2018 task2☆15Sep 14, 2018Updated 7 years ago
- Spectrogram is selected as preprocessing feature of audio clips and a feature representation method based on deep residual network (Spec-…☆27Sep 13, 2020Updated 5 years ago
- An open-source tool for automatic speech recognition ASR quality estimation.☆24Dec 12, 2019Updated 6 years ago
- The details that matter: Frequency resolution of spectrograms in acoustic scene classification - paper replication data☆39Dec 30, 2017Updated 8 years ago
- Voice Activity Detection: In this first assignment, we will create a dataset that simulates speech in every-day scenarios. We train a cla…☆18May 3, 2015Updated 11 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Dec 26, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆38Mar 21, 2018Updated 8 years ago
- This plugin disables automatic screen off and prevents the screen from turning off.☆12May 16, 2026Updated last month
- Masked ConditionaL Neural Networks☆15Jul 6, 2023Updated 2 years ago
- A Java toolkit to generate multi fonts Arabic text images☆11Sep 2, 2021Updated 4 years ago
- It contains Data Augmentaion, Strided convolution, Batch Normalization, Leaky Relu, Global Average pooling, L2 Regularization, learning …☆12Jun 3, 2018Updated 8 years ago
- A Pytorch implementation of triplet loss on VoxCeleb1☆12Oct 16, 2019Updated 6 years ago
- ☆11Dec 22, 2020Updated 5 years ago
- Text-Independent Speaker Recognition Using Gaussian Mixture Models☆12Jul 1, 2015Updated 11 years ago
- A machine learning application for emotion recognition from speech☆137Feb 6, 2018Updated 8 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Detecting emotions using MFCC features of human speech using Deep Learning☆132Dec 2, 2020Updated 5 years ago
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- Coptic NLP pipeline page and utilities☆17Feb 11, 2025Updated last year
- DCASE2016 TASK1 Scene Classification☆12May 2, 2017Updated 9 years ago
- Urban Sound Classification: With Random Forest, SVM, DNN, RNN, and CNN Classifiers☆54Jun 11, 2017Updated 9 years ago
- Kaldi based speaker verification☆47Jan 26, 2018Updated 8 years ago
- Matlab implementation of the paper Noise Spectrum Estimation in Adverse Environments: Improved Minima Controlled Recursive Averaging☆75Aug 1, 2017Updated 8 years ago