The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
☆18Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for Map-Mix
Users that are interested in Map-Mix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- Dataset Release for Phone Number Entity capture task☆14Sep 2, 2022Updated 3 years ago
- Dataset Release for Intent Classification from Speech☆48Feb 23, 2025Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 3 years ago
- Leveraging BERT to Improve Spoken Language Identification☆18Nov 22, 2022Updated 3 years ago
- ☆18Mar 13, 2024Updated 2 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆34Apr 22, 2026Updated 2 months ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Aug 31, 2023Updated 2 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆22Jul 19, 2022Updated 3 years ago
- Create flowcharts in elm☆13Apr 19, 2021Updated 5 years ago
- Dataset release for Emotional TTS in Indian Accent☆41Mar 25, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🗣️ Convert between phonetic alphabets☆11Feb 7, 2022Updated 4 years ago
- A time delay estimation method for event-based time-series data. Time delay estimation is also known as the correction of time offsets an…☆16Dec 3, 2025Updated 6 months ago
- An MRCP server load balancer using OpenSIPS☆19Jun 4, 2020Updated 6 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆33Sep 18, 2025Updated 9 months ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆19Dec 20, 2023Updated 2 years ago
- superfast text to speech in any voice☆62Feb 16, 2026Updated 4 months ago
- A hackable Emacs based data-tagging framework☆21Jul 28, 2019Updated 6 years ago
- Vim plugin to fuzzy search tabs opened in all the browser windows and switch.☆19Feb 5, 2020Updated 6 years ago
- Run commands on remote hosts, inspecting key indicators to manage infrastructure☆15Jan 29, 2026Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Phonetically balanced text to speech sentences☆10Aug 16, 2021Updated 4 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆31Sep 13, 2018Updated 7 years ago
- Official repository of NeXt-TDNN for speaker verification☆83Oct 10, 2024Updated last year
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆14Jun 11, 2024Updated 2 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 3 years ago
- Language understanding toolkit for human dialogs.☆19Sep 6, 2025Updated 9 months ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Job descriptions for Tech roles at Skit☆14Aug 29, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Rainbow Keywords - Official PyTorch Implementation☆14Jun 27, 2024Updated 2 years ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf☆12Dec 2, 2024Updated last year
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆198Sep 24, 2025Updated 9 months ago
- Compute WER and SER for speech recognition evaluation☆26Jun 6, 2026Updated 3 weeks ago
- Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…☆63Jan 18, 2026Updated 5 months ago