Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Pooling Loss.
☆24Oct 11, 2024Updated last year
Alternatives and similar repositories for CE-OptimizedLoss
Users that are interested in CE-OptimizedLoss are comparing it to the libraries listed below
Sorting:
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- A ctc decoder for both online and offline asr model☆66Nov 18, 2023Updated 2 years ago
- Open Source WFST-based Decoder Toolkit☆77Feb 11, 2016Updated 10 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆15Jul 4, 2024Updated last year
- NAR-BERT-ASR☆10Sep 27, 2021Updated 4 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- ☆21Jun 1, 2021Updated 4 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Sep 4, 2019Updated 6 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆91May 25, 2023Updated 2 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…☆14Apr 16, 2020Updated 5 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- **ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…☆24Sep 27, 2022Updated 3 years ago
- ☆32Oct 28, 2022Updated 3 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆91Feb 18, 2025Updated last year
- E2E system with LF-MMI; word N-gram for Mandarin☆167Apr 29, 2022Updated 3 years ago
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- Wav2vec2 Large XLSR 53 fine-tuned for Malayalam☆11Sep 7, 2021Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- The official implementation of the DIFFA series for dLLM-based large audio language model☆67Mar 12, 2026Updated last week
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,807Feb 25, 2026Updated 3 weeks ago
- ☆11Apr 20, 2020Updated 5 years ago
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆188Mar 9, 2026Updated last week
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.☆23Jan 24, 2021Updated 5 years ago
- A CRF-based ASR Toolkit☆366Feb 5, 2026Updated last month
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Mar 19, 2024Updated 2 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆25Jul 1, 2024Updated last year
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 2 months ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- This repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).☆283May 23, 2022Updated 3 years ago