pytorch CTC implementation for ASR. Use eesen's fst decoder framework
☆10Feb 27, 2020Updated 6 years ago
Alternatives and similar repositories for ctc-asr
Users that are interested in ctc-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- ☆11Apr 20, 2020Updated 5 years ago
- Reusable code for Python so I don't have to write the same thing twice!☆12Feb 1, 2019Updated 7 years ago
- APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…☆14Feb 15, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15May 8, 2021Updated 4 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- ☆16May 25, 2019Updated 6 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆64May 23, 2020Updated 5 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Scripts for exporting Kaldi labeled data into TensorFlow☆12Jul 31, 2019Updated 6 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 3 years ago
- Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure☆88Feb 23, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- deep-learning based audio-visual lip bometrics☆15May 9, 2023Updated 2 years ago
- The open source code for SimpleSpeech series☆145Oct 8, 2024Updated last year
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 3 months ago
- ☆42Jun 25, 2018Updated 7 years ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 7 months ago
- TTS Text Analyzer☆31Jul 20, 2023Updated 2 years ago
- Simple, standalone python classes for training statistical language models using several popular smoothing methods.☆25Nov 3, 2012Updated 13 years ago
- Discogs-VI dataset and code☆20Dec 13, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CBbot is a new product form within the CodeBanana ecosystem, positioned as a local-first intelligent agent with full-spectrum operational…☆61Feb 16, 2026Updated last month
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Jul 27, 2024Updated last year
- ☆24Sep 25, 2018Updated 7 years ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- Tensorflow 2 Speech Recognition Code (Transformer)☆25Jun 29, 2020Updated 5 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆167Apr 29, 2022Updated 3 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- Production First and Production Ready End-to-End Keyword Spotting Toolkit☆703Sep 17, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An ambient noise detector☆10Aug 23, 2020Updated 5 years ago
- End-to-end Speech Translation☆35Apr 12, 2021Updated 4 years ago
- RepVgg + HiFiGAN☆36Aug 10, 2022Updated 3 years ago
- JOINT EGO-NOISE SUPPRESSION AND KEYWORD SPOTTING ON SWEEPING ROBOTS☆29May 17, 2022Updated 3 years ago
- [NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆204Dec 9, 2025Updated 3 months ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- ☆23Oct 17, 2024Updated last year