bsxfan / Toroidal-PSDA

A probabilistic scoring backend for length-normalized embeddings.

☆10

Related projects ⓘ

Alternatives and complementary repositories for Toroidal-PSDA

talhanai / wer-sigtest
Script to perform statistical significance test between ASR hypotheses.
☆21Updated 7 years ago
popcornell / SparseLibriMix
☆55Updated 3 years ago
luferrer / ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
☆66Updated 7 months ago
mispchallenge / misp2022_baseline
☆26Updated last year
nobutaka-ito / pulse
Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)
☆39Updated last year
Crystalsound / FRN
☆26Updated last year
dhimasryan / MOSA-Net-Cross-Domain
☆48Updated 5 months ago
BUTSpeechFIT / EEND_dataprep
☆49Updated 6 months ago
fgnt / mms_msg
Multipurpose Multi Speaker Mixture Signal Generator
☆43Updated last month
wyw97 / DENSE
ICASSP2025Dynamic Embedding Causal Target Speech Extraction
☆29Updated last month
khhungg / BSSE-SE
Boosting Self-Supervised Embeddings for Speech Enhancement
☆45Updated 2 years ago
haoxiangsnr / llm-tse
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)
☆32Updated last year
HaoFengyuan / X-TF-GridNet
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…
☆36Updated last month
archiki / Robust-E2E-ASR
This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…
☆46Updated 3 years ago
haoheliu / diffres-python
Learning differentiable temporal resolution on time-series data.
☆33Updated 2 years ago
desh2608 / gss
A simple package for Guided source separation (GSS)
☆107Updated 6 months ago
BUTSpeechFIT / AMI-diarization-setup
☆50Updated last year
haidog-yaqub / DPMTSE
A Diffusion Probabilistic Model for Target Sound Extraction
☆35Updated last month
Andong-Li-speech / TaylorSENet
This is the implementation of the paper ''Taylor, Can You Hear Me Now? A Taylor-Unfolding Framework for Monaural Speech Enhancement'', wh…
☆63Updated 2 years ago
Neclow / SERAB
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Updated last year
microsoft / NOTSOFAR1-Challenge
NOTSOFAR-1 Challenge: Distant Diarization and ASR
☆44Updated last week
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆27Updated last year
sp-uhh / sgmse-bbed
TODO
☆35Updated last year
iiscleap / self_supervised_AHC
Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization
☆16Updated 2 years ago
sinhat98 / adapter-wavlm
☆43Updated last year
BUTSpeechFIT / EEND
☆74Updated 3 months ago
cogmhear / avse_challenge
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆34Updated 8 months ago
kamo-naoyuki / pytorch_complex
A temporal module for PyTorch-ComplexTensor
☆45Updated 4 months ago
desh2608 / pytorch-tdnn
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆38Updated 3 years ago
GasserElbanna / serab-byols
(Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.
☆27Updated 7 months ago