Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆24Feb 25, 2025Updated last year
Alternatives and similar repositories for chime-utils
Users that are interested in chime-utils are comparing it to the libraries listed below
Sorting:
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated last year
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆83Jun 17, 2025Updated 8 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆59Feb 12, 2025Updated last year
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- MeetEval - A meeting transcription evaluation toolkit☆143Jan 27, 2026Updated last month
- Discriminative Training of VBx Diarization☆27Sep 23, 2024Updated last year
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- ☆16Nov 9, 2023Updated 2 years ago
- ☆28Dec 22, 2021Updated 4 years ago
- A simple package for Guided source separation (GSS)☆133May 20, 2024Updated last year
- ☆103Updated this week
- Training data simulation☆58May 6, 2024Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- ☆85Jan 28, 2026Updated last month
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆60Sep 19, 2024Updated last year
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- ☆10Oct 16, 2025Updated 4 months ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated 9 months ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆13Nov 14, 2024Updated last year
- ☆32Jun 26, 2023Updated 2 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- ☆13Mar 11, 2025Updated 11 months ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆61Updated this week
- ☆35Feb 14, 2025Updated last year
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.☆31May 22, 2025Updated 9 months ago
- ☆27Jan 19, 2021Updated 5 years ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated last year
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Text Summarization on Spotify Podcast Transcripts for NLP class at @UNIBO☆17Jul 2, 2022Updated 3 years ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Dec 26, 2022Updated 3 years ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year