skit-ai / Map-MixLinks
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
☆18Updated 2 years ago
Alternatives and similar repositories for Map-Mix
Users that are interested in Map-Mix are comparing it to the libraries listed below
Sorting:
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27Updated 2 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Updated 2 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago
- Python toolkit for speech processing☆72Updated this week
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 5 months ago
- Dataset release for Emotional TTS in Indian Accent☆40Updated 3 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆51Updated last year
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- ☆27Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆62Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆22Updated 3 weeks ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 4 years ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆28Updated last month
- asr2k☆52Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 3 years ago
- Keyword spotting and forced alignment in any language☆76Updated 2 months ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- ☆65Updated last year
- multilingual speech aligner☆77Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆76Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- ☆17Updated last year
- ☆38Updated 3 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 5 months ago