SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated last year
Alternatives and similar repositories for llm_speaker_tagging
Users that are interested in llm_speaker_tagging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- ☆32Jun 26, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Python package for combining diarization system outputs.☆93Oct 12, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- ☆37Mar 30, 2021Updated 4 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- Training data simulation☆59May 6, 2024Updated last year
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆79Jan 9, 2025Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆17May 5, 2024Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆60Feb 12, 2025Updated last year
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆61Sep 19, 2024Updated last year
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆120Mar 18, 2023Updated 3 years ago
- A simple package for Guided source separation (GSS)☆133May 20, 2024Updated last year
- MeetEval - A meeting transcription evaluation toolkit☆149Jan 27, 2026Updated 2 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 9 months ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- ☆94Apr 24, 2025Updated 11 months ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆60Updated this week
- Gemma-based Multilingual Machine Translation Models☆38Feb 13, 2026Updated last month
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆18Jun 21, 2023Updated 2 years ago
- ☆25Jan 2, 2024Updated 2 years ago
- ☆59Mar 28, 2025Updated 11 months ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆58Sep 13, 2022Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆39Jan 6, 2024Updated 2 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆30Jun 12, 2025Updated 9 months ago
- ☆15Sep 13, 2022Updated 3 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆86Jul 31, 2025Updated 7 months ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- [ICASSP 2022] AISHELL-NER: Named Entity Recognition from Chinese Speech☆25Apr 20, 2022Updated 3 years ago