igorsitdikov / lid_kaldi
☆22Updated 3 years ago
Related projects: ⓘ
- Online streaming speaker change detection model in Pytorch☆34Updated last year
- ☆31Updated 2 weeks ago
- ☆9Updated 3 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆27Updated 3 years ago
- An online speech recognition extension toolkit of Kaldi☆57Updated 3 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 3 years ago
- ☆16Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆60Updated 6 months ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆13Updated last month
- Discriminative Training of VBx Diarization☆17Updated 7 months ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last month
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 2 months ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- ☆17Updated last year
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated 2 months ago
- ☆75Updated 2 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆26Updated last month
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆34Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆19Updated this week
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 2 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆27Updated 2 years ago
- Simple Python package for fast DER computation☆31Updated last year
- Python package for combining diarization system outputs.☆73Updated 11 months ago
- ☆48Updated 11 months ago