ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆33Mar 16, 2023Updated 2 years ago
Alternatives and similar repositories for SSL_for_multitalker
Users that are interested in SSL_for_multitalker are comparing it to the libraries listed below
Sorting:
- ☆30Jun 12, 2025Updated 8 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- Materials of public talks given By SJTU X-LANCE members☆14Dec 3, 2022Updated 3 years ago
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- ☆14Jun 17, 2024Updated last year
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 10 months ago
- ☆12Jun 10, 2021Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- ☆29Nov 4, 2025Updated 4 months ago
- ☆18Feb 18, 2026Updated 2 weeks ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- A simple package for Guided source separation (GSS)☆133May 20, 2024Updated last year
- ☆17Jul 22, 2024Updated last year
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆38Oct 27, 2025Updated 4 months ago
- 4 Hour cuSignal Tutorial - ICASSP 2021 Notebooks☆49Jun 7, 2021Updated 4 years ago
- ☆37Mar 30, 2021Updated 4 years ago
- A CSRankings-like index for speech researchers☆35Oct 16, 2024Updated last year
- ☆20Sep 2, 2024Updated last year
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- Script to perform statistical significance test between ASR hypotheses.☆22Aug 13, 2017Updated 8 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 8 years ago
- Official implementation for Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning☆96Nov 20, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"☆82Nov 7, 2025Updated 3 months ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆214Sep 10, 2024Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- ☆30Jul 18, 2024Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆27May 30, 2025Updated 9 months ago
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆41Jun 28, 2025Updated 8 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- ☆10Jul 16, 2024Updated last year