etri / kmsavLinks
☆11Updated 8 months ago
Alternatives and similar repositories for kmsav
Users that are interested in kmsav are comparing it to the libraries listed below
Sorting:
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 4 years ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 8 months ago
- acnn for text-independent speaker recognition☆10Updated 3 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆9Updated 6 months ago
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆14Updated 11 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- [INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion"☆13Updated last month
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆12Updated 7 months ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆20Updated last year
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 3 years ago
- ☆10Updated 7 months ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆17Updated 11 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Updated 3 months ago
- ☆10Updated last year
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Updated 2 weeks ago