freds0 / data_augmentation_for_asr
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆39Updated 3 years ago
Alternatives and similar repositories for data_augmentation_for_asr:
Users that are interested in data_augmentation_for_asr are comparing it to the libraries listed below
- Clustering-based methods for overlapping diarization☆77Updated last year
- multilingual speech aligner☆72Updated last year
- ☆51Updated 4 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆50Updated last month
- ☆64Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 7 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 9 months ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- ☆14Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆68Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- A list of papers for child ASR☆38Updated 5 months ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆86Updated 3 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- ☆30Updated last year
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated 2 months ago
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆27Updated 8 months ago
- A simple package for Guided source separation (GSS)☆117Updated 9 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- ☆64Updated 6 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆60Updated 2 years ago
- ☆16Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆39Updated last year