lawlict / EEND_loss

End-to-end diarization loss

☆22

Alternatives and similar repositories for EEND_loss:

Users that are interested in EEND_loss are comparing it to the libraries listed below

fgnt / mms_msg
Multipurpose Multi Speaker Mixture Signal Generator
☆44Updated last week
desh2608 / css
PyTorch implementation of Continuous Speech Separation
☆13Updated 2 years ago
desh2608 / pytorch-tdnn
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆39Updated 4 years ago
haoheliu / DCASE_2022_Task_5
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Updated 2 years ago
luomingshuang / k2-speechbrain
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Updated 2 years ago
wangkenpu / WSJ2WAV
Convert WSJ sphere format to waveform and do data simulation.
☆16Updated 4 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
☆12Updated 3 years ago
csukuangfj / kaldi-hmm-gmm
☆25Updated 3 months ago
tvuong123 / ModulationDomainLoss
Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021
☆38Updated 3 years ago
kamo-naoyuki / pytorch_complex
A temporal module for PyTorch-ComplexTensor
☆45Updated 7 months ago
popcornell / SparseLibriMix
☆56Updated 4 years ago
X-LANCE / BER
Balanced Error Rate for Speaker Diarization
☆29Updated last year
JorisCos / VCTK-2Mix
☆15Updated 4 years ago
popcornell / OSDC
☆16Updated 4 years ago
FantSun / Speechflow
Speechflow for emotion recognition related information decomposition
☆10Updated 3 years ago
dr-pato / SSGD
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆13Updated 2 years ago
chimechallenge / chime-utils
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆21Updated 2 months ago
popcornell / MicRank
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Updated 3 years ago
schufo / tisms
This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"
☆15Updated 10 months ago
nwpuaslp / ASC_baseline
☆20Updated 4 years ago
shanguanma / Aligners
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆21Updated last year
nii-yamagishilab / Intelligibility-MetricGAN
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…
☆54Updated last year
alumae / torch-xvectors-wav
☆22Updated 3 years ago
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆75Updated last year
speechio / asr-noises
A handy dataset of noises for ASR
☆19Updated 5 years ago
asteroid-team / pytorch-pit
Permutation invariant training in PyTorch
☆13Updated 4 years ago
RicherMans / UIT_Mobile
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆23Updated last year
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Updated 4 years ago
fgnt / paderbox
Paderbox: A collection of utilities for audio / speech processing
☆38Updated 7 months ago
wentaozhu / speechnas
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Updated last year