speech-separation-hse / TIME-DOMAIN-AUDIO-VISUAL-SPEECH-SEPARATIONLinks

Pytorch implementation

☆9

Alternatives and similar repositories for TIME-DOMAIN-AUDIO-VISUAL-SPEECH-SEPARATION

Users that are interested in TIME-DOMAIN-AUDIO-VISUAL-SPEECH-SEPARATION are comparing it to the libraries listed below

Sorting:

zexupan / MuSE
☆39Updated 7 months ago
aispeech-lab / advr-avss
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆17Updated 3 years ago
gemengtju / SpEx_Plus
SpEx+(tied) source code
☆86Updated 2 years ago
zexupan / USEV
☆13Updated last year
Sanyuan-Chen / CSS_with_Conformer
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.
☆118Updated 2 years ago
khhungg / BSSE-SE
Boosting Self-Supervised Embeddings for Speech Enhancement
☆47Updated 3 years ago
RookieJunChen / dns_mos_calculate
Code for calculate DNS_MOS.
☆39Updated 2 years ago
lin9x / AV-Sepformer
☆53Updated 2 years ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
☆13Updated 3 years ago
zyzisyz / mfa_conformer
☆150Updated 2 years ago
zexupan / reentry
☆17Updated 7 months ago
ductuantruong / enskd
Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…
☆16Updated last year
wngh1187 / RawNeXt
Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…
☆25Updated 3 years ago
phonexiaresearch / VBx-training-recipe
☆29Updated 3 years ago
Maokui-He / NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
☆57Updated 10 months ago
SungFeng-Huang / SSL-pretraining-separation
Official repository of our paper: https://arxiv.org/abs/2010.15366
☆63Updated 3 years ago
TaoRuijie / AVCleanse
ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆42Updated 2 years ago
nttcslab-sp / EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆78Updated 2 years ago
danmic / av-se
Deep-Learning-Based Audio-Visual Speech Enhancement and Separation
☆210Updated 2 years ago
qinxiaoyi / Cross-Age_Speaker_Verification
☆29Updated 2 years ago
mispchallenge / MISP-2023-Challenge-Baseline
☆25Updated last year
xuchenglin28 / speaker_extraction_SpEx
multi-scale time domain speaker extraction
☆65Updated 4 years ago
urgent-challenge / urgent2024_challenge
Official data preparation scripts for the URGENT 2024 Challenge
☆80Updated last month
muqiaoy / eGeMAPS_estimator
☆24Updated 3 years ago
chenzhuo1011 / libri_css
Libri-CSS: dataset and evaluation pipeline
☆147Updated 2 years ago
aispeech-lab / LiMuSE
PyTorch implementation of LiMuSE
☆31Updated 2 years ago
XiaoMi / dasheng
Official PyTorch code for Deep Audio-Signal Holistic Embeddings
☆107Updated 2 months ago
mpariente / pywsj0-mix
wsj0-{2, 3, 4, 5} mix generation scripts, in Python.
☆61Updated 4 years ago
yoonsanghyu / FaSNet-TAC-PyTorch
Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)
☆66Updated 3 years ago
cogmhear / avse_challenge
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆40Updated 2 months ago