nmd2k / speech-enhancementLinks
An attempt to Vietnamese speech enhencement with U-net and Unet based ResNet
☆22Updated 4 years ago
Alternatives and similar repositories for speech-enhancement
Users that are interested in speech-enhancement are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆338Updated 2 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆449Updated 6 months ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Updated last year
- PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."☆585Updated 2 years ago
- Conformer-based Metric GAN for speech enhancement☆397Updated last year
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆340Updated 5 years ago
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆342Updated 3 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆112Updated 3 years ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆280Updated 4 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆142Updated 10 months ago
- Voice Activity Detection (VAD) using deep learning.☆201Updated 6 years ago
- Variational Bayes HMM over x-vectors diarization☆278Updated last year
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆299Updated 4 years ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆204Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 2 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆321Updated 11 months ago
- Diarization scoring tools.☆260Updated 2 years ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆219Updated 2 years ago
- An open source dataset for source separation☆457Updated last year
- ☆326Updated 5 years ago
- End-to-End Neural Diarization☆416Updated 4 years ago
- A PyTorch implementation of DNN-based source separation.☆306Updated 3 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- Some comprehensive papers about speaker diarization☆320Updated 6 months ago
- ☆461Updated 2 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆436Updated 3 months ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆182Updated 2 months ago
- Tools for Speech Enhancement integrated with Kaldi☆425Updated 2 years ago
- Speech Separation☆78Updated last year