yuhangear / kaldi-android
☆14Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for kaldi-android
- ☆11Updated 3 years ago
- ☆25Updated last week
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆13Updated last week
- ☆13Updated 3 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆46Updated 4 months ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆15Updated 2 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Updated 2 years ago
- Went online decode demo☆29Updated 3 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆10Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆36Updated 5 months ago
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 5 years ago
- ☆33Updated 2 years ago
- PyTorch implementation of Continuous Speech Separation☆13Updated 2 years ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆20Updated last month
- A simple command line tool to calculate WER for ASR.☆13Updated 3 weeks ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆38Updated last year
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆17Updated last month
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated last year
- python wrapper for kaldi's native I/O☆27Updated 7 months ago
- ☆16Updated 2 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆13Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆15Updated 2 years ago
- ☆26Updated last year
- MagicData-RAMC Dataset and Baseline☆49Updated 2 years ago