A PyTorch implementation of DeepSpeech and DeepSpeech2.
☆50Dec 4, 2018Updated 7 years ago
Alternatives and similar repositories for deepspeech
Users that are interested in deepspeech are comparing it to the libraries listed below
Sorting:
- ☆16Apr 4, 2022Updated 3 years ago
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Oct 1, 2021Updated 4 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆83Jul 20, 2022Updated 3 years ago
- ☆17Aug 27, 2025Updated 6 months ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated last month
- ☆10Apr 8, 2024Updated last year
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 6 years ago
- speech-to-text in pytorch☆82Mar 14, 2019Updated 6 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆12Sep 1, 2021Updated 4 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago
- A phoneme-allophone database for many languages☆53May 19, 2020Updated 5 years ago
- Extract phoneme-level timestamps from speeh audio.☆117Updated this week
- RNN model to punctuate degraded text with no punctuation, and an application that combines it with Watson TTS for automated transcription…☆10Apr 9, 2017Updated 8 years ago
- PyTorch implementations of neural network models for keyword spotting☆11Oct 19, 2020Updated 5 years ago
- PyTorch end-to-end speech recognition☆49Dec 30, 2020Updated 5 years ago
- Deep Learning For Ultrasound Tongue Imaging☆12Dec 17, 2024Updated last year
- ☆32Jul 27, 2022Updated 3 years ago
- GPT for FACodec☆13Mar 25, 2024Updated last year
- Self-contained Python package for OpenFst☆51Feb 1, 2023Updated 3 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- Speech Recognition using DeepSpeech2.☆2,139Dec 13, 2022Updated 3 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- Speech to text library for Rhasspy using Kaldi☆15Dec 9, 2023Updated 2 years ago
- All resources created and used in Arabic Sentiment Analysis of Arabic Tweets. Includes Sentiment lexicon generated from Arabic tweets and…☆14Dec 21, 2021Updated 4 years ago
- Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"☆12May 27, 2019Updated 6 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 6 years ago
- Tesseract4 finetuned traineddata for Central Kurdish/Sorani☆11Apr 18, 2020Updated 5 years ago