maum-ai/voicefilter

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maum-ai/voicefilter)

maum-ai / voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system

☆1,214

Alternatives and similar repositories for voicefilter

Users that are interested in voicefilter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Edresson / VoiceSplit
View on GitHub
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
☆271Jul 25, 2024Updated last year
funcwj / voice-filter
View on GitHub
A unofficial Pytorch implementation of Google's VoiceFilter
☆105Jul 6, 2023Updated 3 years ago
kaituoxu / Conv-TasNet
View on GitHub
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…
☆771Apr 6, 2023Updated 3 years ago
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,577May 13, 2026Updated 2 months ago
xuchenglin28 / speaker_extraction
View on GitHub
target speaker extraction and verification for multi-talker speech
☆210Jan 24, 2021Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
JusperLee / Speech-Separation-Paper-Tutorial
View on GitHub
A must-read paper for speech separation based on neural networks
☆951Aug 11, 2025Updated 11 months ago
gemengtju / SpEx_Plus
View on GitHub
SpEx+(tied) source code
☆96Jul 6, 2023Updated 3 years ago
anicolson / DeepXi
View on GitHub
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
☆523Feb 17, 2022Updated 4 years ago
JorisCos / LibriMix
View on GitHub
An open source dataset for source separation
☆499Feb 9, 2024Updated 2 years ago
aishoot / LSTM_PIT_Speech_Separation
View on GitHub
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
☆311Jan 6, 2022Updated 4 years ago
speechLabBcCuny / onssen
View on GitHub
An open-source speech separation and enhancement library
☆214May 13, 2020Updated 6 years ago
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
funcwj / setk
View on GitHub
Tools for Speech Enhancement integrated with Kaldi
☆431Jul 6, 2023Updated 3 years ago
google / speaker-id
View on GitHub
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆453Aug 12, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jain-abhinav02 / VoiceFilter
View on GitHub
Unofficial Keras implementation of Google AI VoiceFilter
☆43Mar 25, 2023Updated 3 years ago
HarryVolek / PyTorch_Speaker_Verification
View on GitHub
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
☆598Jan 20, 2022Updated 4 years ago
chenzhuo1011 / libri_css
View on GitHub
Libri-CSS: dataset and evaluation pipeline
☆157Jan 18, 2023Updated 3 years ago
funcwj / conv-tasnet
View on GitHub
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…
☆219Jul 6, 2023Updated 3 years ago
ShiZiqiang / dual-path-RNNs-DPRNNs-based-speech-separation
View on GitHub
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling…
☆182Aug 5, 2020Updated 5 years ago
fgnt / nara_wpe
View on GitHub
Different implementations of "Weighted Prediction Error" for speech dereverberation
☆566Mar 19, 2025Updated last year
naplab / Conv-TasNet
View on GitHub
☆337Feb 28, 2020Updated 6 years ago
AppleHolic / source_separation
View on GitHub
Deep learning based speech source separation using Pytorch
☆319Nov 20, 2020Updated 5 years ago
gemengtju / Tutorial_Separation
View on GitHub
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly i…
☆483Jan 9, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
microsoft / DNS-Challenge
View on GitHub
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
☆1,445Jul 25, 2024Updated last year
google / uis-rnn
View on GitHub
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…
☆1,588Sep 25, 2024Updated last year
seanwood / gcc-nmf
View on GitHub
Real-time GCC-NMF Blind Speech Separation and Enhancement
☆327Apr 8, 2019Updated 7 years ago
craigmacartney / Wave-U-Net-For-Speech-Enhancement
View on GitHub
Improved speech enhancement with the Wave-U-Net, a deep convolutional neural network architecture for audio source separation, implemente…
☆224Mar 24, 2023Updated 3 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,885Jul 7, 2026Updated last week
facebookresearch / denoiser
View on GitHub
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…
☆1,904Mar 14, 2023Updated 3 years ago
santi-pdp / segan
View on GitHub
Speech Enhancement Generative Adversarial Network in TensorFlow
☆861Mar 24, 2023Updated 3 years ago
maum-ai / cotatron
View on GitHub
Official code for Cotatron @ INTERSPEECH 2020
☆213Jul 25, 2024Updated last year
JusperLee / Dual-Path-RNN-Pytorch
View on GitHub
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
☆466Feb 14, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / svoice
View on GitHub
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new …
☆1,317Nov 16, 2023Updated 2 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,897Updated this week
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
fgnt / sms_wsj
View on GitHub
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
☆131Jun 7, 2024Updated 2 years ago
ujscjj / DPTNet
View on GitHub
☆119Jan 8, 2021Updated 5 years ago
JusperLee / Conv-TasNet
View on GitHub
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
☆549May 26, 2023Updated 3 years ago
yluo42 / TAC
View on GitHub
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
☆308Jun 15, 2021Updated 5 years ago