ivankunyankin / quartznet-asr
☆19Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for quartznet-asr
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Updated 4 years ago
- Example code for a neural transducer model.☆60Updated 9 months ago
- ☆32Updated 2 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 3 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆65Updated last week
- asr2k☆48Updated 5 months ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated 4 months ago
- ☆56Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆81Updated last week
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- PyTorch end-to-end speech recognition☆49Updated 3 years ago
- Python wrapper for kaldi's arpa2fst☆38Updated last year
- Prosodic Speech Segmentation with Transformers☆23Updated 8 months ago
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- Word Error Rate Estimation☆10Updated 4 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆36Updated 6 months ago
- End-to-end diarization loss☆22Updated 3 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆39Updated 3 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆61Updated 8 months ago
- ☆16Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- ☆56Updated last year
- ☆10Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Multistream CNN for Robust Acoustic Modeling☆39Updated 3 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆72Updated 3 years ago