Audio-WestlakeU / FS-EEND
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming end-to-end neural diarization with online attractor extraction"
☆126Updated last month
Alternatives and similar repositories for FS-EEND:
Users that are interested in FS-EEND are comparing it to the libraries listed below
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆82Updated 2 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆77Updated 10 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- Official repository of NeXt-TDNN for speaker verification☆70Updated 5 months ago
- ☆79Updated 7 months ago
- Target Speaker Extraction Toolkit☆155Updated 3 weeks ago
- ☆57Updated this week
- A simple package for Guided source separation (GSS)☆118Updated 10 months ago
- Clustering-based methods for overlapping diarization☆80Updated last year
- ☆31Updated 11 months ago
- A PyTorch implementation of End-to-End Neural Diarization☆104Updated last year
- Python package for combining diarization system outputs.☆87Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆91Updated 7 months ago
- ☆61Updated last year
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆56Updated 6 months ago
- MeetEval - A meeting transcription evaluation toolkit☆90Updated this week
- Official Repository For VoxBlink2☆64Updated 7 months ago
- Reference-aware automatic speech evaluation toolkit☆145Updated 3 months ago
- ☆46Updated 4 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆51Updated last month
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆108Updated last year
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆40Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆25Updated 11 months ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆186Updated 3 months ago
- ☆54Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆218Updated 11 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆66Updated 3 years ago
- ☆53Updated last week
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆209Updated 6 months ago