GeorgeEfstathiadis / LLM-Diarize-ASR-Agnostic
Repository for "LLM-based speaker diarization correction: A generalizable approach" paper
☆12Updated 8 months ago
Alternatives and similar repositories for LLM-Diarize-ASR-Agnostic:
Users that are interested in LLM-Diarize-ASR-Agnostic are comparing it to the libraries listed below
- ☆60Updated last week
- ☆31Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆83Updated 3 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆52Updated 2 months ago
- Error correction back-end for speaker diarization☆16Updated last year
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆27Updated 3 weeks ago
- ☆14Updated 9 months ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆20Updated 2 months ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆38Updated 2 weeks ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆20Updated 2 years ago
- ☆65Updated last year
- ☆26Updated 2 months ago
- Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".☆47Updated last week
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆32Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆93Updated 7 months ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆74Updated 10 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆61Updated 8 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆13Updated 10 months ago
- ☆31Updated this week
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆75Updated last year
- Streaming Audiotransformers for online Audio tagging☆44Updated 10 months ago
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆14Updated 11 months ago
- ☆20Updated last month
- SelfRemaster: SSL Speech Restoration☆88Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆131Updated last month
- Discriminative Training of VBx Diarization☆23Updated 7 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year