A summary of speech data augment algorithms
☆69Jan 12, 2021Updated 5 years ago
Alternatives and similar repositories for speech_data_augment
Users that are interested in speech_data_augment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.☆14Apr 4, 2023Updated 3 years ago
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- ☆16Apr 24, 2021Updated 5 years ago
- A curated list of awesome Voiceprint Recognition papers☆19Jul 9, 2021Updated 4 years ago
- Dual-Path Attention and Recurrent Network for speech separation☆21Sep 12, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Dec 14, 2023Updated 2 years ago
- ☆28Jun 1, 2023Updated 3 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- ☆21Mar 2, 2022Updated 4 years ago
- Instantaneous PSD estimation for speech enhancement based on generalized principal components.☆11Jul 1, 2020Updated 5 years ago
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆24Aug 14, 2025Updated 10 months ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆44May 23, 2023Updated 3 years ago
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆51Oct 14, 2025Updated 8 months ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- Nested U-Net with two-level skip connections for speech enhancement☆36Dec 18, 2023Updated 2 years ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆73May 11, 2024Updated 2 years ago
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆54May 17, 2019Updated 7 years ago
- In defence of metric learning for speaker recognition☆1,169Apr 22, 2026Updated last month
- ☆11Sep 16, 2020Updated 5 years ago
- ☆33Apr 11, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 6 years ago
- ☆160Jan 30, 2024Updated 2 years ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- pre-process script for timit data for dnn-aec works☆38Mar 3, 2022Updated 4 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 3 years ago
- ☆160Jan 9, 2023Updated 3 years ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Aug 3, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Dec 22, 2023Updated 2 years ago
- Probabilistic Spherical Discriminant Analysis☆12Oct 29, 2022Updated 3 years ago
- 《声纹技术:从核心算法到工程实践》☆176Sep 12, 2022Updated 3 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆74Sep 16, 2020Updated 5 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆654Apr 5, 2022Updated 4 years ago
- ☆16Jun 15, 2022Updated 4 years ago
- ☆15Nov 5, 2021Updated 4 years ago