BUTSpeechFIT / diacorrect
Error correction back-end for speaker diarization
☆15Updated last year
Alternatives and similar repositories for diacorrect:
Users that are interested in diacorrect are comparing it to the libraries listed below
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆28Updated 3 months ago
- ☆28Updated last week
- Discriminative Training of VBx Diarization☆21Updated 3 months ago
- ☆31Updated 9 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆36Updated 3 weeks ago
- ☆10Updated last month
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆20Updated 4 months ago
- ☆56Updated 11 months ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 4 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆21Updated 3 months ago
- This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).☆21Updated last month
- ☆23Updated this week
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆36Updated last year
- A toolkit dedicate for speech evaluation.☆19Updated 3 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 3 months ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆11Updated last month
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Updated last year
- Streaming Audiotransformers for online Audio tagging☆43Updated 7 months ago
- ☆30Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 3 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆24Updated 9 months ago
- ConMamba for Automatic Speech Recognition☆53Updated 5 months ago
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆58Updated last month
- This is the official implementation of the LiSenNet☆33Updated 2 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆54Updated 4 months ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆34Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated last month