zzpDapeng/speech_data_augment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zzpDapeng/speech_data_augment)

zzpDapeng / speech_data_augment

A summary of speech data augment algorithms

☆69

Alternatives and similar repositories for speech_data_augment

Users that are interested in speech_data_augment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Wang-Jingrun / DCCRN
View on GitHub
Implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch.
☆15Apr 4, 2023Updated 3 years ago
KhanhNguyen4999 / Speech-Enhancement-CLSKD
View on GitHub
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement
☆11Jun 22, 2023Updated 3 years ago
ssprl / Real-time-Blind-source-separation-using-IVA
View on GitHub
☆16Apr 24, 2021Updated 5 years ago
nonday / awesome-voiceprint
View on GitHub
A curated list of awesome Voiceprint Recognition papers
☆19Jul 9, 2021Updated 5 years ago
yangyi0818 / DPARNet
View on GitHub
Dual-Path Attention and Recurrent Network for speech separation
☆22Sep 12, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wngh1187 / Diff-SV
View on GitHub
Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…
☆23Dec 14, 2023Updated 2 years ago
dangf15 / DPT-FSNet
View on GitHub
☆28Jun 1, 2023Updated 3 years ago
mechanicalsea / sugar
View on GitHub
Efficient Speech Processing Tookit for Automatic Speaker Recognition
☆17Feb 8, 2023Updated 3 years ago
sc0ttms / SE-DCCRN
View on GitHub
☆22Mar 2, 2022Updated 4 years ago
tdietzen / INST-PSD
View on GitHub
Instantaneous PSD estimation for speech enhancement based on generalized principal components.
☆11Jul 1, 2020Updated 6 years ago
spkgyk / RTFS-Net
View on GitHub
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
☆51Oct 14, 2025Updated 9 months ago
YUCHEN005 / DPSL-ASR
View on GitHub
Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"
☆44May 23, 2023Updated 3 years ago
seorim0 / NUNet-TLS
View on GitHub
Nested U-Net with two-level skip connections for speech enhancement
☆38Dec 18, 2023Updated 2 years ago
Liu-Tianchi / Golden-Gemini-for-Speaker-Verification
View on GitHub
Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'
☆15Jan 20, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
L6-NLP / Generative-Annotation-NEC
View on GitHub
Generative_Annotation_NEC: A novel NEC method that utilizes speech sound features to retrieve candidate entities and a generative method …
☆17Dec 2, 2025Updated 7 months ago
hwanyyy / preprocessing-of-speech
View on GitHub
VAD + resampling | High resolution spectrogram
☆14Nov 29, 2022Updated 3 years ago
tongjinle123 / speech-transformer-pytorch_lightning
View on GitHub
ASR project with pytorch-lightning
☆20Mar 21, 2025Updated last year
RoyChao19477 / PCS
View on GitHub
Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)
☆73May 11, 2024Updated 2 years ago
Magic-Bubble / SpeechProcessForMachineLearning
View on GitHub
用于机器学习的语音特征提取，包含FBank和MFCC等，原理讲解和step by step的实现
☆54May 17, 2019Updated 7 years ago
PunkMale / ECAPA-TDNN-CNCeleb
View on GitHub
针对CN-Celeb数据集的基于ECAPA-TDNN的说话人识别的pytorch实现
☆13Apr 3, 2023Updated 3 years ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
Hguimaraes / SEWUNet
View on GitHub
[Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)
☆32Nov 22, 2022Updated 3 years ago
YongyuG / dnn_aec_data_process
View on GitHub
pre-process script for timit data for dnn-aec works
☆38Mar 3, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cszheng-ioa / Sixty-years-of-frequency-domain-monaural-speech-enhancement
View on GitHub
☆161Jan 30, 2024Updated 2 years ago
jingyonghou / KWS_Max-pooling_RHE
View on GitHub
Mining effective negative training samples for keyword spotting (PyTorch)
☆66May 23, 2020Updated 6 years ago
levtelyatnikov / radiomixer
View on GitHub
radiomixer
☆14Feb 16, 2022Updated 4 years ago
ORI-Muchim / BEGANSing
View on GitHub
BEGANSing - Korean SVS + SVC + AudioSR
☆11Feb 17, 2024Updated 2 years ago
JasonZhang156 / Sound-Recognition-Tutorial
View on GitHub
A simple sound recognition tutorial, including data analysis, feature extraction, model building, model train and model test ...
☆93May 24, 2019Updated 7 years ago
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
bsxfan / PSDA
View on GitHub
Probabilistic Spherical Discriminant Analysis
☆12Oct 29, 2022Updated 3 years ago
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
seongmin-kye / meta-SR
View on GitHub
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
☆73Sep 16, 2020Updated 5 years ago
RicherMans / CDur
View on GitHub
Repository for the paper "Towards duration robust weakly supervised sound event detection"
☆23Aug 3, 2023Updated 2 years ago
nishithbsk / ConflictPrediction
View on GitHub
Predicting Political Instability and Social Conflicts Using Multimodal Data
☆10Jun 6, 2016Updated 10 years ago
Mleader2 / bert_music_correct
View on GitHub
音乐类语料的意图识别填槽以及槽值纠错模型
☆19Mar 24, 2023Updated 3 years ago
rrbluke / CNBF
View on GitHub
Complex Neural Beamformer
☆33Oct 15, 2020Updated 5 years ago
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
cvqluu / Factorized-TDNN
View on GitHub
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …
☆149Jan 6, 2020Updated 6 years ago