A summary of speech data augment algorithms
☆69Jan 12, 2021Updated 5 years ago
Alternatives and similar repositories for speech_data_augment
Users that are interested in speech_data_augment are comparing it to the libraries listed below
Sorting:
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.☆14Apr 4, 2023Updated 2 years ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- ☆16Apr 24, 2021Updated 4 years ago
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆22Aug 14, 2025Updated 6 months ago
- ☆11Sep 16, 2020Updated 5 years ago
- Dual-Path Attention and Recurrent Network for speech separation☆19Sep 12, 2024Updated last year
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- A curated list of awesome Voiceprint Recognition papers☆18Jul 9, 2021Updated 4 years ago
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆49Oct 14, 2025Updated 4 months ago
- ☆18May 27, 2020Updated 5 years ago
- ☆33Apr 11, 2024Updated last year
- ☆53Jan 15, 2021Updated 5 years ago
- Implementation of paper "DPCRN: Dual-Path Convolution Recurrent Network for Single Channel Speech Enhancement"☆229Apr 22, 2024Updated last year
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆53May 17, 2019Updated 6 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 5 years ago
- ☆28Jun 1, 2023Updated 2 years ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Aug 3, 2023Updated 2 years ago
- ☆158Jan 30, 2024Updated 2 years ago
- Robust Speech Activity Detection (SAD) in movie audio☆26Jan 27, 2021Updated 5 years ago
- An example of a speech enhancement model deployed with TensorRT.☆78Mar 24, 2025Updated 11 months ago
- In defence of metric learning for speaker recognition☆1,165Mar 26, 2024Updated last year
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Jan 28, 2019Updated 7 years ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆89Feb 2, 2026Updated last month
- A Pytorch-based implementation of the compression and decompression module in "Ultra Dual-Path Compression For Joint Echo Cancellation An…☆64Feb 20, 2024Updated 2 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Apr 5, 2022Updated 3 years ago
- A Simple and Efficient Implementation Of Fast Fourier Transform For Audio Denoise☆112Aug 11, 2020Updated 5 years ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆72May 11, 2024Updated last year
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆282May 23, 2022Updated 3 years ago
- A research project and comparative study on various Active Noise Cancellation Algorithms like FxLMS, EMFN, Chebyshev filter and Hammerste…☆10Jul 3, 2022Updated 3 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- Manage audio and video datasets☆34Updated this week
- A training code template for DNN-based speech enhancement.☆171Sep 4, 2025Updated 6 months ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,378Jul 25, 2024Updated last year
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- ☆129Mar 21, 2019Updated 6 years ago
- You can find the speech algorithms you want here☆854Jan 25, 2026Updated last month