CHiME-9 Task 1 - MCoRec baseline
☆26Jan 13, 2026Updated last month
Alternatives and similar repositories for mcorec_baseline
Users that are interested in mcorec_baseline are comparing it to the libraries listed below
Sorting:
- Audio-Visual Speech Recognition☆20Jul 7, 2025Updated 7 months ago
- ☆16Jan 11, 2026Updated last month
- ☆85Jan 28, 2026Updated last month
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆38Oct 27, 2025Updated 4 months ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 9 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- ☆46Jul 7, 2025Updated 7 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆83Jun 17, 2025Updated 8 months ago
- Discriminative Training of VBx Diarization☆27Sep 23, 2024Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated this week
- ☆103Updated this week
- Official implementation of USR (NeurIPS 2024)☆39Dec 21, 2024Updated last year
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- Clustering-based methods for overlapping diarization☆82Jan 12, 2024Updated 2 years ago
- ☆92Apr 24, 2025Updated 10 months ago
- ☆39Oct 14, 2022Updated 3 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Oct 18, 2023Updated 2 years ago
- ☆11Oct 31, 2024Updated last year
- [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"☆57Nov 3, 2025Updated 4 months ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Updated this week
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆106Jan 10, 2025Updated last year
- This is a Dockerfile to use stable_diffusion.openvino in Docker container.☆13Aug 29, 2022Updated 3 years ago
- ☆10Jul 8, 2025Updated 7 months ago
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆12Dec 21, 2024Updated last year
- trending repositories and news related to AI☆10Mar 22, 2019Updated 6 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- A real-time voice AI system that integrates OpenAI's Realtime API, Llama3 with Twilio Voice to create intelligent voice conversations.☆22Sep 6, 2025Updated 5 months ago
- ☆10May 16, 2024Updated last year
- ☆11Oct 25, 2021Updated 4 years ago
- ☆24Oct 9, 2025Updated 4 months ago
- Hands-on hub to learn techniques to optimize and serve AI models to production the most optimal way.☆14Aug 20, 2025Updated 6 months ago
- ☆10Jan 26, 2021Updated 5 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- FormulaOne: A dataset of algorithmic problems based on MSO formulas.☆24Aug 14, 2025Updated 6 months ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- This repo covers the different guardrail options available in the market☆25Oct 2, 2025Updated 5 months ago
- Unet based on Wavelet coefficients for segmentation☆11Jan 31, 2020Updated 6 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago