sid0710 / audio_data_augmentation
☆26Updated 7 years ago
Related projects: ⓘ
- ☆27Updated 6 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆93Updated 4 years ago
- Voxceleb1 i-vector based speaker recognition system☆41Updated 6 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 4 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆37Updated 6 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆62Updated 5 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆58Updated 4 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 6 years ago
- DCASE 2016 Baseline system, python implementation☆51Updated 7 years ago
- Training General-Purpose Audio Tagging Networks with Noisy Labels and Iterative Self-Verification☆29Updated 5 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- DCASE 2017 Baseline system☆82Updated 4 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 5 years ago
- ☆50Updated this week
- Bidirectional dynamic RNN + CTC for phoneme recognition☆44Updated 4 years ago
- a python library for different types of vocoders like LPC, MCEP, PSOLA, etc.☆35Updated 9 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 6 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- ☆37Updated 7 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆36Updated last year
- ☆23Updated this week
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆49Updated 6 years ago
- Cochlear.ai submission for dcase2018 task2☆17Updated 6 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 7 years ago
- Task 4 Large-scale weakly supervised sound event detection for smart cars☆62Updated 2 years ago
- ☆59Updated 3 years ago
- ☆98Updated 6 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago