hwanyyy / preprocessing-of-speech
VAD + resampling | High resolution spectrogram
☆13Updated last year
Related projects ⓘ
Alternatives and complementary repositories for preprocessing-of-speech
- MultiSV: scripts for data preparation☆25Updated last week
- Recipe for LibriPhrase☆23Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- Discriminative Training of VBx Diarization☆18Updated last month
- ☆50Updated last year
- ☆37Updated 2 years ago
- Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)☆73Updated 4 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆19Updated last year
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Speech enhancement (Interspeech 2016, Ideal)☆19Updated 2 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆70Updated 2 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 2 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆43Updated 5 years ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 3 years ago
- Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification☆19Updated 4 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆64Updated 2 years ago
- A simple package for Guided source separation (GSS)☆107Updated 6 months ago
- A statistical model-based Speech Enhancement Using MMSE-STSA☆74Updated 6 years ago
- ☆21Updated 3 weeks ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆16Updated 2 years ago
- ☆59Updated 4 years ago
- Dual-Path RNN for Single-Channel Speech Separation (in Keras-Tensorflow2)☆34Updated 4 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆77Updated 3 years ago
- Speech Enhancement Metrics (PESQ, CSIG, CBAK, COVL)☆63Updated 4 years ago
- Audio Only Speech Enhancement using Unet☆9Updated 3 years ago
- A list of papers for child ASR☆26Updated last month
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- A PyTorch implementation of Conv-TasNet☆46Updated 4 years ago
- The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation☆42Updated 5 years ago