aispeech-lab/WASE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aispeech-lab/WASE)

aispeech-lab / WASE

PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Environments"

☆27

Alternatives and similar repositories for WASE

Users that are interested in WASE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
yangdongchao / Tim-TSENet
View on GitHub
The source code of Tim-TSENet
☆15Apr 22, 2022Updated 4 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
huaidanquede / Dense-TSNet
View on GitHub
offical code for Dense-TSNet
☆12Sep 17, 2024Updated last year
Aworselife / DPTBF
View on GitHub
☆17Sep 12, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
aispeech-lab / LiMuSE
View on GitHub
PyTorch implementation of LiMuSE
☆33Oct 11, 2022Updated 3 years ago
Okrio / deepvqe
View on GitHub
☆14Oct 12, 2023Updated 2 years ago
mborsdorf / UniversalSpeakerExtraction
View on GitHub
☆15Sep 6, 2021Updated 4 years ago
JonathanDZ / TF-FaSNet
View on GitHub
☆24Feb 28, 2023Updated 3 years ago
xuchenglin28 / speech_separation
View on GitHub
Constrained Permutation Invariant Training, Speech Separation
☆52Jan 24, 2021Updated 5 years ago
aispeech-lab / SDNet
View on GitHub
Pytorch implemention of SDNet
☆23Jun 1, 2021Updated 5 years ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
BUTSpeechFIT / speakerbeam
View on GitHub
☆146Oct 25, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lijin0120 / CELSDS
View on GitHub
A Chinese Expressive Long-dialogue Speech Dataset with Scripts
☆21Nov 11, 2024Updated last year
aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
Xiaobin-Rong / lite-rtse
View on GitHub
An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement
☆14Nov 19, 2023Updated 2 years ago
xuchenglin28 / speaker_extraction_SpEx
View on GitHub
multi-scale time domain speaker extraction
☆81Jun 7, 2021Updated 5 years ago
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
HuangZikang-TJU / Aug4TSE
View on GitHub
☆15Sep 16, 2024Updated last year
phonexiaresearch / VBx-training-recipe
View on GitHub
☆33Mar 11, 2022Updated 4 years ago
xuchenglin28 / target_speaker_verification
View on GitHub
target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech
☆15Jan 26, 2021Updated 5 years ago
qiuqiangkong / sampleRNN_acoustic_scene_generation
View on GitHub
☆14Apr 18, 2019Updated 7 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
fgnt / sms_wsj
View on GitHub
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
☆131Jun 7, 2024Updated 2 years ago
WangHelin1997 / GL-AT
View on GitHub
Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.
☆13Feb 6, 2021Updated 5 years ago
iiscleap / self_supervised_AHC
View on GitHub
Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization
☆17Dec 16, 2021Updated 4 years ago
CPJKU / dcase2024_task1_baseline
View on GitHub
☆10Jun 6, 2024Updated 2 years ago
lili-0805 / MVAE
View on GitHub
Official PyTorch implementation of MVAE for audio source separation
☆43Dec 21, 2022Updated 3 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
LiChenda / Multi-clue-TSE-data
View on GitHub
Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"
☆17May 19, 2023Updated 3 years ago
haoxiangsnr / SpEx
View on GitHub
Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".
☆37Jul 19, 2020Updated 6 years ago
Sanyuan-Chen / CSS_with_Conformer
View on GitHub
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.
☆120Mar 18, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
fakufaku / torchiva
View on GitHub
Blind source separation with independent vector analysis family of algorithm in torch
☆108Jan 30, 2023Updated 3 years ago
yongxuUSTC / grnnbf
View on GitHub
Generalized RNN beamformer for speech separation
☆18Jan 11, 2022Updated 4 years ago
yangdongchao / Target-sound-event-detection
View on GitHub
The source code for target sound detection
☆15Feb 26, 2022Updated 4 years ago
jadfegh / audiovision
View on GitHub
Real-time Speech Separation, Noise Suppression & Speaker Recognition
☆17Apr 17, 2019Updated 7 years ago
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago