SLT 2024 Challenge: Post-ASR-Speaker-Tagging
☆16Jun 16, 2024Updated 2 years ago
Alternatives and similar repositories for llm_speaker_tagging
Users that are interested in llm_speaker_tagging are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆26Feb 25, 2025Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆33Jun 26, 2023Updated 2 years ago
- Python package for combining diarization system outputs.☆94Oct 12, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- ☆38Mar 30, 2021Updated 5 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆80Oct 18, 2022Updated 3 years ago
- Training data simulation☆60May 6, 2024Updated 2 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆78Jan 9, 2025Updated last year
- ☆15Jul 4, 2024Updated last year
- ☆18Jul 22, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17May 5, 2024Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆61Feb 12, 2025Updated last year
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆120Mar 18, 2023Updated 3 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆62Sep 19, 2024Updated last year
- A simple package for Guided source separation (GSS)☆134May 20, 2024Updated 2 years ago
- MeetEval - A meeting transcription evaluation toolkit☆162Jan 27, 2026Updated 4 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆87Jun 17, 2025Updated 11 months ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- ☆95Apr 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆66Apr 2, 2026Updated 2 months ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆18Jun 21, 2023Updated 2 years ago
- Gemma-based Multilingual Machine Translation Models☆48Feb 13, 2026Updated 4 months ago
- ☆25Jan 2, 2024Updated 2 years ago
- ☆59Mar 28, 2025Updated last year
- MagicData-RAMC Dataset and Baseline☆64Sep 13, 2022Updated 3 years ago
- AudioVisual Diarization - Supervised and Unsupervised☆15Nov 22, 2022Updated 3 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆41Jan 6, 2024Updated 2 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆32Jun 12, 2025Updated last year
- ☆15Sep 13, 2022Updated 3 years ago
- Baseline system for Language-based Audio Retrieval (Task 6B) in DCASE 2023 Challenge☆10Aug 8, 2023Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging☆11Dec 2, 2021Updated 4 years ago
- ☆88Jul 31, 2025Updated 10 months ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 3 years ago