pytorch CTC implementation for ASR. Use eesen's fst decoder framework
☆10Feb 27, 2020Updated 6 years ago
Alternatives and similar repositories for ctc-asr
Users that are interested in ctc-asr are comparing it to the libraries listed below
Sorting:
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- deep-learning based audio-visual lip bometrics☆15May 9, 2023Updated 2 years ago
- ☆15May 8, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Jun 29, 2020Updated 5 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 3 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 5 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆88Feb 23, 2018Updated 8 years ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- ☆32Nov 24, 2024Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Jan 15, 2020Updated 6 years ago
- An ambient noise detector☆10Aug 23, 2020Updated 5 years ago
- ☆11Apr 20, 2020Updated 5 years ago
- The open source code for SimpleSpeech series☆145Oct 8, 2024Updated last year
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- ☆45Apr 5, 2019Updated 6 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 4 years ago
- ☆11Aug 28, 2017Updated 8 years ago
- Repository for my studies of Causal Inference☆10Dec 1, 2019Updated 6 years ago
- This repos provides an MATLAB code implementation for the Statistical Approach to Texture Classification from Single Images paper by Varm…☆12Jan 30, 2018Updated 8 years ago
- This project is a PyTorch implementation of the paper "ECViT: Efficient Convolutional Vision Transformer with Local-Attention and Multi-s…☆19Jun 12, 2025Updated 8 months ago
- AR-VirtualGlassesTryOn☆13May 21, 2016Updated 9 years ago