A PyTorch implementation of DeepSpeech and DeepSpeech2.
☆50Dec 4, 2018Updated 7 years ago
Alternatives and similar repositories for deepspeech
Users that are interested in deepspeech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simple automatic speech recognition system based on digits corpora (Polish language), created in Kaldi toolkit. Despite of the language d…☆11May 29, 2016Updated 9 years ago
- Adversarial attack against DeepSpeech2 ASR pytorch model☆24Jan 15, 2021Updated 5 years ago
- Neural Network Semantic Parser for Almond☆15Apr 11, 2019Updated 7 years ago
- A step-by-step problem set for implementing a high-quality deep dependency parser in Pytorch☆15Aug 12, 2017Updated 8 years ago
- Deep Learning For Ultrasound Tongue Imaging☆13Dec 17, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Apr 4, 2022Updated 4 years ago
- Speech Recognition using DeepSpeech2.☆2,137Dec 13, 2022Updated 3 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated 3 months ago
- Losses and decoders for end-to-end ASR and OCR☆34Oct 30, 2020Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆83Jul 20, 2022Updated 3 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Oct 1, 2021Updated 4 years ago
- Code for TALLIP2019 paper "µ-Forcing: Training Variational Recurrent Autoencoders for Text Generation"☆12May 27, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Deep understanding and modelling of the hierarchical structure of prosody☆24May 12, 2019Updated 6 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- A series of Jupyter notebooks on signal processing☆53Dec 16, 2018Updated 7 years ago
- PyTorch implementations of neural network models for keyword spotting☆11Oct 19, 2020Updated 5 years ago
- Source code of paper "FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge"☆22Jun 12, 2024Updated last year
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- ☆17Apr 14, 2023Updated 3 years ago
- Neural Language Models as Psycholinguistic Subjects: Representations of Syntactic State☆17Mar 4, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Aug 27, 2025Updated 8 months ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEOS USING CONVOLUTIONAL LSTM NEURAL NETWORKS☆19Oct 29, 2018Updated 7 years ago
- Self-contained Python package for OpenFst☆51Feb 1, 2023Updated 3 years ago
- Speech recognition with CTC in Keras with Tensorflow backend☆31Mar 24, 2023Updated 3 years ago
- Improved Speech Enhancement GANs☆12Jun 24, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆26Apr 1, 2026Updated 3 weeks ago
- Simple example how to use tensorflow's CTC loss with Voxforge speech data☆18Nov 12, 2016Updated 9 years ago
- MTracker is a tool for automatic splining tongue shapes in ultrasound images by harnessing the power of deep convolutional neural network…☆20Feb 12, 2021Updated 5 years ago
- Evaluation of the classification performance (Speech, Music, and Noise) of 1D (WaveNet) and 2D (MobileNet) CNN and RNN (GRU) on the MUSAN…☆15Sep 23, 2020Updated 5 years ago
- Data processing tools for preparing speech and labels for training TTS voices☆29Aug 13, 2020Updated 5 years ago
- Materials for "Transformers from the Ground Up" at PyData Jeddah on August 5, 2021☆20Aug 5, 2021Updated 4 years ago