skit-ai/Map-Mix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/skit-ai/Map-Mix)

skit-ai / Map-Mix

The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)

☆18

Alternatives and similar repositories for Map-Mix

Users that are interested in Map-Mix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GATECH-EIC / S3-Router
View on GitHub
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Sep 19, 2023Updated 2 years ago
skit-ai / speech-to-intent-dataset
View on GitHub
Dataset Release for Intent Classification from Speech
☆48Feb 23, 2025Updated last year
Lhx94As / Awesome-Spoken-Language-Identification
View on GitHub
An awesome spoken LID repository. (Working in progress
☆109Apr 22, 2024Updated 2 years ago
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
skit-ai / slu-prosody
View on GitHub
Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…
☆27May 17, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
THUsatlab / BERT-LID
View on GitHub
Leveraging BERT to Improve Spoken Language Identification
☆17Nov 22, 2022Updated 3 years ago
tuanct1997 / Federated-Learning-ASR-based-on-wav2vec-2.0
View on GitHub
☆18Mar 13, 2024Updated 2 years ago
YChenL / DS-TDNN
View on GitHub
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
☆41Aug 31, 2023Updated 2 years ago
khanld / Dynamic-Mixing
View on GitHub
Dynamic Mixing For Speech Processing (mix-on-the-fly)
☆22Jul 19, 2022Updated 4 years ago
skit-ai / tech
View on GitHub
Skit's tech website
☆11Jul 1, 2024Updated 2 years ago
pravj / inside-cricket
View on GitHub
Source code for "Inside Cricket: A fifth umpire' view of your favorite sport"
☆12Apr 15, 2018Updated 8 years ago
skit-ai / emotion-tts-dataset
View on GitHub
Dataset release for Emotional TTS in Indian Accent
☆41Mar 25, 2026Updated 3 months ago
deep-privacy / SA-toolkit
View on GitHub
SA-toolkit: Speaker speech anonymization toolkit in python
☆33Sep 18, 2025Updated 10 months ago
skit-ai / mrcp-load-balancer
View on GitHub
An MRCP server load balancer using OpenSIPS
☆19Jun 4, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
yogeshbalaji / Normalized-Wasserstein
View on GitHub
Normalized Wasserstein for Mixture Distributions
☆11Mar 24, 2023Updated 3 years ago
koudounasalkis / Audio-Speech-Tutorial
View on GitHub
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
☆19Dec 20, 2023Updated 2 years ago
skit-ai / tog
View on GitHub
A hackable Emacs based data-tagging framework
☆21Jul 28, 2019Updated 6 years ago
fedderrico / ubm_map_diarization
View on GitHub
Speaker diarization with GMM-UBM and MAP Adaptation
☆31Sep 13, 2018Updated 7 years ago
jagabandhumishra / W2V-E2E-Language-Diarization
View on GitHub
☆11Sep 4, 2023Updated 2 years ago
vipul-sharma20 / vim-browser-tabs
View on GitHub
Vim plugin to fuzzy search tabs opened in all the browser windows and switch.
☆19Feb 5, 2020Updated 6 years ago
rhasspy / tts-prompts
View on GitHub
Phonetically balanced text to speech sentences
☆10Aug 16, 2021Updated 4 years ago
microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
go-bridget / inspector
View on GitHub
Run commands on remote hosts, inspecting key indicators to manage infrastructure
☆15Jan 29, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
koudounasalkis / AI4Voice
View on GitHub
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
☆15Jun 11, 2024Updated 2 years ago
SSTC-Challenge / SSTC2024_baseline_system
View on GitHub
☆12Jun 14, 2024Updated 2 years ago
zhepeiw / cssl_sound
View on GitHub
☆14Jan 17, 2023Updated 3 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
skit-ai / dialogy
View on GitHub
Language understanding toolkit for human dialogs.
☆19Sep 6, 2025Updated 10 months ago
kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
View on GitHub
☆27Mar 29, 2021Updated 5 years ago
IDRnD / redimnet
View on GitHub
The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"
☆205Jul 9, 2026Updated 2 weeks ago
swagshaw / Rainbow-Keywords
View on GitHub
Rainbow Keywords - Official PyTorch Implementation
☆14Jun 27, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
skit-ai / job-descriptions
View on GitHub
Job descriptions for Tech roles at Skit
☆14Aug 29, 2024Updated last year
zafarrafii / Zaf-Python
View on GitHub
Zafar's Audio Functions in Python for audio signal analysis: STFT, inverse STFT, mel filterbank, mel spectrogram, MFCC, CQT kernel, CQT s…
☆59Aug 8, 2025Updated 11 months ago
fgnt / speaker_reassignment
View on GitHub
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆14Feb 5, 2025Updated last year
nikvaessen / disjoint-mtl
View on GitHub
Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf
☆12Dec 2, 2024Updated last year
r10a / music-speech-classifier
View on GitHub
Aim to implement a classifier which classifies an audio sample into speech or music.
☆10Sep 17, 2019Updated 6 years ago
vTAD2025-Challenge / vTAD
View on GitHub
☆17Oct 24, 2025Updated 9 months ago
ayh2bxa / realtime_nkf_aec
View on GitHub
☆18Dec 27, 2023Updated 2 years ago