freds0 / data_augmentation_for_asrLinks
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆47Updated 4 years ago
Alternatives and similar repositories for data_augmentation_for_asr
Users that are interested in data_augmentation_for_asr are comparing it to the libraries listed below
Sorting:
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆174Updated 7 months ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 3 years ago
- High-Fidelity Neural Phonetic Posteriorgrams☆121Updated 10 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Updated 7 months ago
- A simple package for Guided source separation (GSS)☆132Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆65Updated last year
- ☆54Updated 2 years ago
- Online streaming speaker change detection model in Pytorch☆44Updated 2 years ago
- multilingual speech aligner☆76Updated 2 years ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆54Updated 5 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- ☆66Updated last year
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Updated last year
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆74Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Updated 4 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Updated 11 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 9 months ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 5 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- Production first, nn-based on-device signal processing toolkit.☆65Updated 2 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Updated 2 years ago
- A list of papers for child ASR☆50Updated last year
- VoxLingua107 recipe for SpeechBrain☆13Updated 4 years ago
- Python toolkit for speech processing☆72Updated 3 weeks ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Expressive Anechoic Recordings of Speech (EARS)☆206Updated last year
- ☆14Updated 3 years ago