IS2AI / Uzbek_ASR
☆11Updated 3 years ago
Alternatives and similar repositories for Uzbek_ASR:
Users that are interested in Uzbek_ASR are comparing it to the libraries listed below
- ☆12Updated 2 years ago
- ☆35Updated last month
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- Getting confidences from any end-to-end systems☆11Updated last year
- ☆21Updated this week
- ☆12Updated 2 months ago
- ☆13Updated 3 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆20Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆22Updated 2 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆20Updated 5 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 9 months ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 3 years ago
- ☆13Updated 2 years ago
- ☆16Updated 5 years ago
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆21Updated 7 months ago
- ☆22Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated 6 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆42Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆21Updated 5 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆13Updated 5 years ago
- open-source Mandarian biased word dataset☆11Updated last year
- Word Error Rate Estimation☆13Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆31Updated 11 months ago