freds0 / data_augmentation_for_asr
A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.
☆36Updated 3 years ago
Alternatives and similar repositories for data_augmentation_for_asr:
Users that are interested in data_augmentation_for_asr are comparing it to the libraries listed below
- Clustering-based methods for overlapping diarization☆74Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆47Updated this week
- multilingual speech aligner☆73Updated last year
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- A list of papers for child ASR☆35Updated 3 months ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆18Updated 5 months ago
- ☆21Updated 5 months ago
- ConMamba for Automatic Speech Recognition☆54Updated 5 months ago
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- An implementation for Frame-level Speech Signal-to-Noise Ratio Estimation using deep learning☆35Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- A simple package for Guided source separation (GSS)☆112Updated 8 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 3 months ago
- ☆16Updated 2 years ago
- ☆25Updated 5 months ago
- ☆29Updated 3 years ago
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆29Updated 3 months ago
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆27Updated 6 months ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆46Updated last month
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆48Updated this week
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆54Updated 4 months ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Discriminative Training of VBx Diarization☆22Updated 4 months ago
- End-to-end diarization loss☆22Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆73Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆41Updated 3 years ago