A summary of speech data augment algorithms
☆69Jan 12, 2021Updated 5 years ago
Alternatives and similar repositories for speech_data_augment
Users that are interested in speech_data_augment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.☆14Apr 4, 2023Updated 3 years ago
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- ☆16Apr 24, 2021Updated 4 years ago
- A curated list of awesome Voiceprint Recognition papers☆19Jul 9, 2021Updated 4 years ago
- Dual-Path Attention and Recurrent Network for speech separation☆21Sep 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Dec 14, 2023Updated 2 years ago
- ☆28Jun 1, 2023Updated 2 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- ☆20Mar 2, 2022Updated 4 years ago
- Instantaneous PSD estimation for speech enhancement based on generalized principal components.☆12Jul 1, 2020Updated 5 years ago
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆24Aug 14, 2025Updated 8 months ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆52Oct 14, 2025Updated 6 months ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- Nested U-Net with two-level skip connections for speech enhancement☆36Dec 18, 2023Updated 2 years ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆73May 11, 2024Updated last year
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated last year
- In defence of metric learning for speaker recognition☆1,163Updated this week
- ☆11Sep 16, 2020Updated 5 years ago
- ☆34Apr 11, 2024Updated 2 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- pre-process script for timit data for dnn-aec works☆37Mar 3, 2022Updated 4 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- ☆160Jan 9, 2023Updated 3 years ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Aug 3, 2023Updated 2 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- Probabilistic Spherical Discriminant Analysis☆12Oct 29, 2022Updated 3 years ago
- 针对CN-Celeb数据集的基于ECAPA-TDNN的说话人识别的pytorch实现☆13Apr 3, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆75Sep 16, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆655Apr 5, 2022Updated 4 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- ☆15Nov 5, 2021Updated 4 years ago
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆149Jan 6, 2020Updated 6 years ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,403Jul 25, 2024Updated last year
- ☆478Oct 12, 2023Updated 2 years ago