A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆48Oct 15, 2021Updated 4 years ago
Alternatives and similar repositories for data_augmentation_for_asr
Users that are interested in data_augmentation_for_asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving Recording Device Generalization using Impulse Response Augmentation☆21Apr 24, 2025Updated last year
- A lightweight audio codec based on a single quantizer☆34Sep 4, 2025Updated 9 months ago
- Benchmarking different VAD models on AVA-Speech dataset☆18May 21, 2023Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- golang vad (voice activity detection) library based on webrtc☆12Dec 13, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 11 months ago
- SoTA open-source TTS☆23Jun 17, 2025Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- Implementation of Sheffield entry for Clarity enhancement challenge.☆18Apr 19, 2022Updated 4 years ago
- Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech☆20Feb 9, 2025Updated last year
- Personalized Item Exploration Processes for Recommendation☆15Sep 19, 2019Updated 6 years ago
- A library for speech data augmentation in time-domain☆689Aug 30, 2021Updated 4 years ago
- 🗣️ Convert between phonetic alphabets☆11Feb 7, 2022Updated 4 years ago
- The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.☆102May 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Sep 24, 2018Updated 7 years ago
- Real-Time De-noising and De-reverbing with Tiny Recurrent UNet☆56Jun 7, 2023Updated 3 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- Discogs-VI dataset and code☆21Dec 13, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆53May 1, 2025Updated last year
- [ICON 2020] TensorFlow Code for "End-to-End Automatic Speech Recognition System for Gujarati"☆13Jul 26, 2021Updated 4 years ago
- Personalized AEC☆19Nov 3, 2022Updated 3 years ago
- ☆23May 15, 2023Updated 3 years ago
- A time delay estimation method for event-based time-series data. Time delay estimation is also known as the correction of time offsets an…☆16Dec 3, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17Apr 3, 2022Updated 4 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- Unofficial Tensorflow/Keras implementation of Google AI VoiceFilter☆16Mar 25, 2023Updated 3 years ago
- VoxLingua107 recipe for SpeechBrain☆13Jul 3, 2021Updated 4 years ago
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Apr 12, 2021Updated 5 years ago
- Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡☆11Jan 23, 2025Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 6 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆99May 30, 2025Updated last year
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆14Mar 14, 2024Updated 2 years ago
- This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection fr…☆12Dec 19, 2025Updated 6 months ago
- One command to build TLG.fst for WeNet.☆30Oct 11, 2022Updated 3 years ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆65Aug 29, 2022Updated 3 years ago
- ☆10Oct 25, 2019Updated 6 years ago