A summary of speech data augment algorithms
☆69Jan 12, 2021Updated 5 years ago
Alternatives and similar repositories for speech_data_augment
Users that are interested in speech_data_augment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.☆14Apr 4, 2023Updated 2 years ago
- Cross-Layer Similarity Knowledge Distillation for Speech Enhancement☆11Jun 22, 2023Updated 2 years ago
- Dual-Path Attention and Recurrent Network for speech separation☆19Sep 12, 2024Updated last year
- ☆16Apr 24, 2021Updated 4 years ago
- A curated list of awesome Voiceprint Recognition papers☆18Jul 9, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Dec 14, 2023Updated 2 years ago
- ☆28Jun 1, 2023Updated 2 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- ☆20Mar 2, 2022Updated 4 years ago
- Instantaneous PSD estimation for speech enhancement based on generalized principal components.☆12Jul 1, 2020Updated 5 years ago
- Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Informa…☆23Aug 14, 2025Updated 7 months ago
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆50Oct 14, 2025Updated 5 months ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- Nested U-Net with two-level skip connections for speech enhancement☆36Dec 18, 2023Updated 2 years ago
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆73May 11, 2024Updated last year
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆53May 17, 2019Updated 6 years ago
- In defence of metric learning for speaker recognition☆1,164Mar 26, 2024Updated 2 years ago
- ☆11Sep 16, 2020Updated 5 years ago
- ☆34Apr 11, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆18May 27, 2020Updated 5 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆31Nov 22, 2022Updated 3 years ago
- ☆158Jan 30, 2024Updated 2 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆64May 23, 2020Updated 5 years ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- pre-process script for timit data for dnn-aec works☆37Mar 3, 2022Updated 4 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Feb 17, 2024Updated 2 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Aug 3, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆10Dec 22, 2023Updated 2 years ago
- Probabilistic Spherical Discriminant Analysis☆12Oct 29, 2022Updated 3 years ago
- 《声纹技术:从核心算法到工程实践》☆177Sep 12, 2022Updated 3 years ago
- 针对CN-Celeb数据集的基于ECAPA-TDNN的说话人识别的pytorch实现☆13Apr 3, 2023Updated 2 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆75Sep 16, 2020Updated 5 years ago
- ☆16Jun 15, 2022Updated 3 years ago
- ☆15Nov 5, 2021Updated 4 years ago