nmd2k / speech-enhancementLinks
An attempt to Vietnamese speech enhencement with U-net and Unet based ResNet
☆21Updated 3 years ago
Alternatives and similar repositories for speech-enhancement
Users that are interested in speech-enhancement are comparing it to the libraries listed below
Sorting:
- Finetune Wa2vec 2.0 For Speech Recognition☆138Updated 6 months ago
- Official PyTorch Implementation of CleanUNet (ICASSP 2022)☆329Updated last year
- A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorc…☆331Updated 4 years ago
- Conformer-based Metric GAN for speech enhancement☆376Updated last year
- Voice Activity Detection (VAD) using deep learning.☆197Updated 5 years ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆211Updated 5 years ago
- Source code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEE…☆197Updated 2 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆111Updated 2 years ago
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆416Updated 3 months ago
- PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."☆574Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆284Updated last year
- Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.☆336Updated 2 years ago
- Voice Activity Detection based on Deep Learning & TensorFlow☆368Updated 2 years ago
- transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.☆283Updated 4 years ago
- The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".☆266Updated last month
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆217Updated 2 years ago
- An open source dataset for source separation☆440Updated last year
- End-to-End Neural Diarization☆406Updated 4 years ago
- speech to text with self-supervised learning based on wav2vec 2.0 framework☆383Updated 3 years ago
- ☆445Updated last year
- Variational Bayes HMM over x-vectors diarization☆275Updated last year
- The Microsoft Scalable Noisy Speech Dataset (MS-SNSD) is a noisy speech dataset that can scale to arbitrary sizes depending on the number…☆544Updated last year
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆101Updated 2 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆427Updated 3 weeks ago
- Deep learning for audio denoising☆727Updated last year
- Tools for Speech Enhancement integrated with Kaldi☆420Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆177Updated 8 months ago
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆26Updated 4 years ago
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆297Updated 8 months ago