The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
☆18Feb 17, 2023Updated 3 years ago
Alternatives and similar repositories for Map-Mix
Users that are interested in Map-Mix are comparing it to the libraries listed below
Sorting:
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- Dataset Release for Phone Number Entity capture task☆14Sep 2, 2022Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- An awesome spoken LID repository. (Working in progress☆109Apr 22, 2024Updated last year
- Dataset Release for Intent Classification from Speech☆48Feb 23, 2025Updated last year
- Skit's tech website☆11Jul 1, 2024Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆16Jun 23, 2024Updated last year
- Vim plugin to get scores and commentary of live cricket matches☆12Jun 13, 2019Updated 6 years ago
- Source code for "Inside Cricket: A fifth umpire' view of your favorite sport"☆12Apr 15, 2018Updated 7 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- Create flowcharts in elm☆13Apr 19, 2021Updated 4 years ago
- Leveraging BERT to Improve Spoken Language Identification☆17Nov 22, 2022Updated 3 years ago
- ☆18Mar 13, 2024Updated last year
- An MRCP server load balancer using OpenSIPS☆19Jun 4, 2020Updated 5 years ago
- Dataset release for Emotional TTS in Indian Accent☆40Sep 2, 2022Updated 3 years ago
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Aug 31, 2023Updated 2 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Jul 19, 2022Updated 3 years ago
- Language understanding toolkit for human dialogs.☆19Sep 6, 2025Updated 5 months ago
- Job descriptions for Tech roles at Skit☆14Aug 29, 2024Updated last year
- Vim plugin to fuzzy search tabs opened in all the browser windows and switch.☆19Feb 5, 2020Updated 6 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆30Sep 18, 2025Updated 5 months ago
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- A hackable Emacs based data-tagging framework☆21Jul 28, 2019Updated 6 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆33Oct 23, 2025Updated 4 months ago
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Aug 20, 2023Updated 2 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆49May 6, 2024Updated last year
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- 😎 Awesome lists about Speech Emotion Recognition☆101Dec 24, 2024Updated last year
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- ☆27Mar 29, 2021Updated 4 years ago
- Speaker diarization with GMM-UBM and MAP Adaptation☆31Sep 13, 2018Updated 7 years ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆42Apr 5, 2023Updated 2 years ago
- Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…☆56Jan 18, 2026Updated last month
- Official repository of NeXt-TDNN for speaker verification☆80Oct 10, 2024Updated last year
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Feb 24, 2026Updated last week
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago