SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated last year
Alternatives and similar repositories for llm_speaker_tagging
Users that are interested in llm_speaker_tagging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆33Jun 26, 2023Updated 2 years ago
- Python package for combining diarization system outputs.☆94Oct 12, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- ☆37Mar 30, 2021Updated 5 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- Training data simulation☆59May 6, 2024Updated last year
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆79Jan 9, 2025Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆17May 5, 2024Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆60Feb 12, 2025Updated last year
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆120Mar 18, 2023Updated 3 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆62Sep 19, 2024Updated last year
- A simple package for Guided source separation (GSS)☆133May 20, 2024Updated last year
- MeetEval - A meeting transcription evaluation toolkit☆151Jan 27, 2026Updated 2 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 9 months ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- ☆95Apr 24, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Gemma-based Multilingual Machine Translation Models☆40Feb 13, 2026Updated 2 months ago
- ☆61Apr 2, 2026Updated 2 weeks ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆18Jun 21, 2023Updated 2 years ago
- ☆25Jan 2, 2024Updated 2 years ago
- ☆59Mar 28, 2025Updated last year
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆59Sep 13, 2022Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆30Jun 12, 2025Updated 10 months ago
- ☆15Sep 13, 2022Updated 3 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆88Jul 31, 2025Updated 8 months ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago