NVIDIA/OpenSeq2Seq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/OpenSeq2Seq)

NVIDIA / OpenSeq2Seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

☆1,559

Alternatives and similar repositories for OpenSeq2Seq

Users that are interested in OpenSeq2Seq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flashlight / wav2letter
View on GitHub
Facebook AI Research's Automatic Speech Recognition Toolkit
☆6,440Updated this week
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,398Mar 14, 2022Updated 4 years ago
tensorflow / lingvo
View on GitHub
Lingvo
☆2,860Jun 22, 2026Updated 3 weeks ago
NVIDIA / waveglow
View on GitHub
A Flow-based Generative Network for Speech Synthesis
☆2,340Oct 19, 2023Updated 2 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,895Updated this week
syhw / wer_are_we
View on GitHub
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,864Jun 27, 2022Updated 4 years ago
ksw0306 / ClariNet
View on GitHub
A Pytorch Implementation of ClariNet
☆293Aug 5, 2019Updated 6 years ago
SeanNaren / deepspeech.pytorch
View on GitHub
Speech Recognition using DeepSpeech2.
☆2,136Dec 13, 2022Updated 3 years ago
Rayhane-mamah / Tacotron-2
View on GitHub
DeepMind's Tacotron-2 Tensorflow implementation
☆2,323Jul 6, 2023Updated 3 years ago
HawkAaron / warp-transducer
View on GitHub
A fast parallel implementation of RNN Transducer.
☆314Jun 7, 2023Updated 3 years ago
NVIDIA / nv-wavenet
View on GitHub
Reference implementation of real-time autoregressive wavenet inference
☆745Jan 19, 2021Updated 5 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
ksw0306 / FloWaveNet
View on GitHub
A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"
☆490Apr 23, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NVIDIA / tacotron2
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆5,301Jun 12, 2024Updated 2 years ago
mkotha / WaveRNN
View on GitHub
A WaveRNN implementation
☆201Oct 14, 2019Updated 6 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
TensorSpeech / TensorFlowASR
View on GitHub
TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…
☆1,009Jun 11, 2025Updated last year
syang1993 / gst-tacotron
View on GitHub
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
☆367Dec 6, 2018Updated 7 years ago
srvk / eesen
View on GitHub
The official repository of the Eesen project
☆834May 23, 2019Updated 7 years ago
npuichigo / waveglow
View on GitHub
A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis
☆205Nov 6, 2018Updated 7 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,431Sep 22, 2025Updated 9 months ago
NVIDIA / mellotron
View on GitHub
Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…
☆870Jul 22, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,244Sep 30, 2025Updated 9 months ago
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
View on GitHub
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…
☆3,125Oct 19, 2023Updated 2 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
jaywalnut310 / waveglow-vqvae
View on GitHub
WaveGlow vocoder with VQVAE
☆61Jun 18, 2019Updated 7 years ago
soobinseo / Transformer-TTS
View on GitHub
A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
☆690Nov 8, 2023Updated 2 years ago
nii-yamagishilab / TSNetVocoder
View on GitHub
☆42Oct 30, 2018Updated 7 years ago
awni / speech
View on GitHub
A PyTorch Implementation of End-to-End Models for Speech-to-Text
☆768Jul 6, 2023Updated 3 years ago
pykaldi / pykaldi
View on GitHub
A Python wrapper for Kaldi
☆1,038Nov 30, 2025Updated 7 months ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
NVIDIA / Milano
View on GitHub
Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.
☆154Nov 7, 2018Updated 7 years ago
Kyubyong / expressive_tacotron
View on GitHub
Tensorflow Implementation of Expressive Tacotron
☆194Nov 3, 2018Updated 7 years ago
DemisEom / SpecAugment
View on GitHub
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆655Apr 5, 2022Updated 4 years ago
NVIDIA-NeMo / Speech
View on GitHub
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…
☆17,789Updated this week
hrbigelow / ae-wavenet
View on GitHub
Wavenet Autoencoder for Unsupervised speech representation learning (after Chorowski, Jan 2019)
☆176Sep 16, 2020Updated 5 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆372Oct 12, 2021Updated 4 years ago