Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16May 9, 2021Updated 5 years ago
Alternatives and similar repositories for E2E_ASR_Confidence_Estimation
Users that are interested in E2E_ASR_Confidence_Estimation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆22Jul 26, 2021Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆34May 3, 2024Updated 2 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Feb 9, 2021Updated 5 years ago
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- ☆12Mar 23, 2026Updated 2 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- ☆16Aug 1, 2025Updated 9 months ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 7 years ago
- ☆12May 18, 2022Updated 4 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Using SepFormer☆10Feb 2, 2023Updated 3 years ago
- c++ code for merlin tts☆22Oct 19, 2019Updated 6 years ago
- 一个深信服EasyConnect的自动控制程序☆13Nov 2, 2020Updated 5 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- image-segmentation and text-localization☆12Aug 22, 2018Updated 7 years ago
- ☆15Jul 4, 2024Updated last year
- ☆18Jul 22, 2024Updated last year
- ☆15Aug 25, 2022Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆13Aug 6, 2020Updated 5 years ago
- 在云端存储学习的过程产生的代码☆11Nov 30, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆15Aug 30, 2022Updated 3 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 4 years ago
- ☆12Aug 9, 2021Updated 4 years ago
- Decoders from Kaldi using OpenFst☆36Apr 10, 2026Updated last month
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago
- ☆25Mar 11, 2026Updated 2 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago