A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆48Oct 15, 2021Updated 4 years ago
Alternatives and similar repositories for data_augmentation_for_asr
Users that are interested in data_augmentation_for_asr are comparing it to the libraries listed below
Sorting:
- Benchmarking different VAD models on AVA-Speech dataset☆18May 21, 2023Updated 2 years ago
- A lightweight audio codec based on a single quantizer☆33Sep 4, 2025Updated 6 months ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- Implementation of RIPT in golang☆11May 3, 2020Updated 5 years ago
- SoTA open-source TTS☆23Jun 17, 2025Updated 8 months ago
- Kamailio in Kubernetes configuration manager☆14May 2, 2019Updated 6 years ago
- WebRTC SIP client on golang for FreeSwitch☆11Aug 2, 2021Updated 4 years ago
- Reproduction of a paper"Small-footprint keyword spotting using deep neural networks"☆12Mar 11, 2019Updated 6 years ago
- English to French and Chinese to French .json dictionaries for Synthesizer V☆12Feb 1, 2023Updated 3 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- VoxLingua107 recipe for SpeechBrain☆13Jul 3, 2021Updated 4 years ago
- ☆17Apr 3, 2022Updated 3 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- Support material and source code for the model described in : "A Recurrent Encoder-Decoder Approach With Skip-Filtering Connections For M…☆13Sep 19, 2017Updated 8 years ago
- Unofficial Tensorflow/Keras implementation of Google AI VoiceFilter☆16Mar 25, 2023Updated 2 years ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆20Feb 9, 2025Updated last year
- Personalized Item Exploration Processes for Recommendation☆15Sep 19, 2019Updated 6 years ago
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Personalized AEC☆19Nov 3, 2022Updated 3 years ago
- Improved Speech Enhancement GANs☆12Jun 24, 2020Updated 5 years ago
- Audio captioning recipe☆51Oct 23, 2025Updated 4 months ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆20Apr 24, 2025Updated 10 months ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- Real-Time De-noising and De-reverbing with Tiny Recurrent UNet☆55Jun 7, 2023Updated 2 years ago
- This in an implementation of NSNet in PyTorch and PyTorch Lightning. NSNet is a recurrent neural network for single channel speech enhanc…☆40Aug 20, 2020Updated 5 years ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- SIPCheck is a tool that watch the authentication of users of Asterisk and bans automatically if some user (or bot) try to register o make…☆25Mar 14, 2020Updated 5 years ago
- RES via complex-valued DNN☆25Sep 3, 2021Updated 4 years ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Oct 21, 2025Updated 4 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆177Apr 15, 2025Updated 10 months ago
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆53Apr 16, 2025Updated 10 months ago
- ☆23May 15, 2023Updated 2 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆100May 24, 2023Updated 2 years ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated last year
- ☆25Feb 28, 2023Updated 3 years ago