placebokkk / ctc-asrView external linksLinks
pytorch CTC implementation for ASR. Use eesen's fst decoder framework
☆10Feb 27, 2020Updated 5 years ago
Alternatives and similar repositories for ctc-asr
Users that are interested in ctc-asr are comparing it to the libraries listed below
Sorting:
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- deep-learning based audio-visual lip bometrics☆15May 9, 2023Updated 2 years ago
- ☆15May 8, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Jun 29, 2020Updated 5 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 3 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆65May 23, 2020Updated 5 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆88Feb 23, 2018Updated 7 years ago
- TTS Text Analyzer☆32Jul 20, 2023Updated 2 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Pronunciation-assisted Subword Modeling☆31May 30, 2019Updated 6 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- ☆32Nov 24, 2024Updated last year
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Jan 15, 2020Updated 6 years ago
- An ambient noise detector☆10Aug 23, 2020Updated 5 years ago
- ☆11Apr 20, 2020Updated 5 years ago
- The open source code for SimpleSpeech series☆145Oct 8, 2024Updated last year
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 5 months ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- Rich Prosody Diversity Modelling with Phone-level Mixture Density Network☆45Dec 1, 2021Updated 4 years ago
- Multilingual speech translation☆41Apr 15, 2021Updated 4 years ago
- ☆45Apr 5, 2019Updated 6 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago
- ☆41Jun 25, 2018Updated 7 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- halcon算子阈值分割的实现☆12Apr 13, 2018Updated 7 years ago
- Simple Telegram bot to annotate and varify automatic speech recognition datasets☆12Mar 30, 2021Updated 4 years ago
- Code base for WaveTransformer: A novel architecture for automated audio captioning☆44Mar 1, 2021Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆13Apr 5, 2020Updated 5 years ago