Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16May 9, 2021Updated 4 years ago
Alternatives and similar repositories for E2E_ASR_Confidence_Estimation
Users that are interested in E2E_ASR_Confidence_Estimation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆22Jul 26, 2021Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆34May 3, 2024Updated 2 years ago
- ☆12Jun 10, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Feb 9, 2021Updated 5 years ago
- E2E ASR system☆14Oct 20, 2022Updated 3 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- ☆12Mar 23, 2026Updated last month
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- ☆16Aug 1, 2025Updated 9 months ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 6 years ago
- ☆12May 18, 2022Updated 3 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Oct 25, 2020Updated 5 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Using SepFormer☆10Feb 2, 2023Updated 3 years ago
- c++ code for merlin tts☆22Oct 19, 2019Updated 6 years ago
- 一个深信服EasyConnect的自动控制程序☆13Nov 2, 2020Updated 5 years ago
- Code for ACL-IJCNLP 2021 paper "N-Best-ASR-Transformer: Enhancing SLU Performance using Multiple ASR Hypotheses."☆17Nov 30, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Character-level Recurrent Neural Network Language Model (rnnlm) implement in Pytorch.☆12Oct 4, 2020Updated 5 years ago
- PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper☆12Mar 4, 2022Updated 4 years ago
- image-segmentation and text-localization☆12Aug 22, 2018Updated 7 years ago
- ☆15Jul 4, 2024Updated last year
- ☆18Jul 22, 2024Updated last year
- ☆15Aug 25, 2022Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆13Aug 6, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆24Mar 11, 2026Updated last month
- 在云端存储学习的过程产生的代码☆11Nov 30, 2020Updated 5 years ago
- ☆15Aug 30, 2022Updated 3 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- ☆12Aug 9, 2021Updated 4 years ago
- Decoders from Kaldi using OpenFst☆34Apr 10, 2026Updated 3 weeks ago
- Error correction back-end for speaker diarization☆18Sep 26, 2023Updated 2 years ago