Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16May 9, 2021Updated 4 years ago
Alternatives and similar repositories for E2E_ASR_Confidence_Estimation
Users that are interested in E2E_ASR_Confidence_Estimation are comparing it to the libraries listed below
Sorting:
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆10Oct 16, 2025Updated 4 months ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- ☆23Updated this week
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆12Feb 9, 2021Updated 5 years ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated last year
- A merged version of multiple open-source German speech datasets.☆34May 3, 2024Updated last year
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆13Aug 6, 2020Updated 5 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- ☆15Aug 1, 2025Updated 7 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 3 years ago
- ☆17Jul 22, 2024Updated last year
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- ☆15Jul 4, 2024Updated last year
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 5 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27May 17, 2023Updated 2 years ago
- ☆20Sep 2, 2024Updated last year
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- ☆50Feb 24, 2026Updated last week
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- c++ code for merlin tts☆22Oct 19, 2019Updated 6 years ago
- ☆24Sep 20, 2024Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- ☆40Jul 15, 2025Updated 7 months ago
- [USENIX Security 2025] SafeSpeech: Robust and Universal Voice Protection Against Malicious Speech Synthesis☆32May 24, 2025Updated 9 months ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- MSP-Podcast Challenge Baseline Code☆31Jun 12, 2024Updated last year