artbataev/end2end

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/artbataev/end2end)

artbataev / end2end

Losses and decoders for end-to-end ASR and OCR

☆34

Alternatives and similar repositories for end2end

Users that are interested in end2end are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jfainberg / lattice_combination
View on GitHub
Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices
☆16Mar 19, 2024Updated 2 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
mpuels / docker-py-kaldi-asr-and-model
View on GitHub
STT Service based on Kaldi ASR
☆15Aug 17, 2018Updated 7 years ago
daanzu / kaldi-fork-active-grammar
View on GitHub
☆10Updated this week
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ljuvela / GELP
View on GitHub
☆27Apr 21, 2021Updated 5 years ago
awni / transducer
View on GitHub
A Fast Sequence Transducer Implementation with PyTorch Bindings
☆200Sep 20, 2022Updated 3 years ago
kaituoxu / X-Punctuator
View on GitHub
A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text withou…
☆63May 13, 2020Updated 6 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
Sundy1219 / ctc_beam_search_lm
View on GitHub
CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统
☆49Jun 27, 2018Updated 8 years ago
gooofy / kaldi-adapt-lm
View on GitHub
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Jan 26, 2020Updated 6 years ago
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago
EgorLakomkin / KTSpeechCrawler
View on GitHub
Automatically constructing corpus for automatic speech recognition from YouTube videos
☆157Feb 15, 2020Updated 6 years ago
idiap / contextual-biasing-on-gpus
View on GitHub
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆21Sep 25, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
awni / py-arpa-lm
View on GitHub
Python API for reading and querying ARPA formatted language models.
☆33Sep 9, 2014Updated 11 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
opendcd / opendcd
View on GitHub
Open Source WFST-based Decoder Toolkit
☆75Feb 11, 2016Updated 10 years ago
alikaratana / SpeakerRecognition
View on GitHub
Text-Dependent Speaker Recognition System with Machine Learning Techniques
☆10Dec 31, 2017Updated 8 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
mdangschat / speech-corpus-dl
View on GitHub
Download and preperation tool for free speech corpora.
☆16Apr 28, 2019Updated 7 years ago
homink / kaldi-asr.forced_decoding
View on GitHub
Perform the forced decoding with target transcription
☆11Sep 12, 2018Updated 7 years ago
skit-ai / kaldi-serve
View on GitHub
Server framework for Kaldi ASR Toolkit
☆99Sep 17, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kaituoxu / kaldi-ktnet1
View on GitHub
Kaldi extended by Kaituo XU with new features in nnet1.
☆12Dec 16, 2018Updated 7 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
jzlianglu / pykaldi2
View on GitHub
Yet another speech toolkit based on Kaldi and PyTorch
☆173Jul 1, 2020Updated 6 years ago
jpuigcerver / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆80Jun 10, 2019Updated 7 years ago
bajibabu / GlottGAN
View on GitHub
This repository contains the files used for our Interspeech 2017 paper.
☆16May 30, 2017Updated 9 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
k2-fsa / kaldi-decoder
View on GitHub
Decoders from Kaldi using OpenFst
☆35Apr 10, 2026Updated 3 months ago
rosinality / melgan-pytorch
View on GitHub
MelGAN and Tacotron 2 in PyTorch
☆11Oct 22, 2019Updated 6 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
candlewill / CNTN
View on GitHub
ChiNese Text Normalization (CNTN) tool for Text-to-speech system
☆37Apr 12, 2018Updated 8 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
corticph / prefix-beam-search
View on GitHub
Code for prefix beam search tutorial by @labodk
☆186Dec 9, 2020Updated 5 years ago