tango4j / llm_speaker_taggingView external linksLinks
SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated last year
Alternatives and similar repositories for llm_speaker_tagging
Users that are interested in llm_speaker_tagging are comparing it to the libraries listed below
Sorting:
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated 11 months ago
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- ☆32Jun 26, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- ☆17May 5, 2024Updated last year
- ☆37Mar 30, 2021Updated 4 years ago
- ☆15Jul 4, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆119Mar 18, 2023Updated 2 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆19Jun 21, 2023Updated 2 years ago
- ☆50Jan 28, 2026Updated 2 weeks ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Feb 12, 2025Updated last year
- Training data simulation☆58May 6, 2024Updated last year
- ☆25Jan 2, 2024Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆60Sep 19, 2024Updated last year
- Gemma-based Multilingual Machine Translation Models☆33Feb 5, 2026Updated last week
- MeetEval - A meeting transcription evaluation toolkit☆141Jan 27, 2026Updated 2 weeks ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago
- A simple package for Guided source separation (GSS)☆132May 20, 2024Updated last year
- ☆30Jun 12, 2025Updated 8 months ago
- Non-parallel voice conversion called ICRCycleGAN-VC based on CycleGAN and Inception-resNet module by Afiuny☆15Oct 30, 2025Updated 3 months ago
- Detecting and correction dysfluencies/stuttering/stammering in audio files☆10Apr 23, 2023Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆83Jun 17, 2025Updated 7 months ago
- ☆91Apr 24, 2025Updated 9 months ago
- ☆86Jul 31, 2025Updated 6 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Oct 18, 2023Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- “Welcome to my GitHub repository, a hub of exploration and innovation in the realm of data science. 📊💻 Here, you’ll find a curated coll…☆10Apr 3, 2025Updated 10 months ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- ☆10Oct 20, 2022Updated 3 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 10 months ago