Edresson / VoiceSplit
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
☆222Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for VoiceSplit
- Speaker embedding (d-vector) trained with GE2E loss☆273Updated 10 months ago
- Tools for Speech Enhancement integrated with Kaldi☆397Updated last year
- An open source dataset for source separation☆378Updated 9 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆344Updated 3 months ago
- Variational Bayes HMM over x-vectors diarization☆252Updated 9 months ago
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆208Updated 3 years ago
- Official repository for RawNet, RawNet2, and RawNet3☆360Updated 7 months ago
- General Speech Restoration☆276Updated 9 months ago
- target speaker extraction and verification for multi-talker speech☆163Updated 3 years ago
- Diarization scoring tools.☆217Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆138Updated 5 months ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆242Updated 6 months ago
- Voice Activity Detection (VAD) using deep learning.☆191Updated 5 years ago
- Conformer-based Metric GAN for speech enhancement☆311Updated 6 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆125Updated 2 weeks ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆312Updated last week
- Python implementation of the Short Term Objective Intelligibility measure☆326Updated 10 months ago
- PPG-Based Voice Conversion☆328Updated 2 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆337Updated 2 years ago
- Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…☆308Updated last year
- Voice Activity Detection based on Deep Learning & TensorFlow☆354Updated last year
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆323Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆99Updated last year
- ESPnet Model Zoo☆245Updated last year
- End-to-End Neural Diarization☆371Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆306Updated 3 years ago
- A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR☆904Updated last year
- A python package for calculating the PESQ.☆355Updated last year
- Unofficial implementation of PercepNet: A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech☆333Updated last year
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆315Updated 4 years ago