distillpub/post--ctc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/distillpub/post--ctc)

distillpub / post--ctc

Sequence Modelling with CTC

☆52

Alternatives and similar repositories for post--ctc

Users that are interested in post--ctc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

1ytic / warp-rna
View on GitHub
Recurrent Neural Aligner
☆51Apr 14, 2020Updated 6 years ago
noajshu / scotus-speech
View on GitHub
Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court
☆22Dec 8, 2022Updated 3 years ago
dayanavivolab / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
DaoZhang0123 / compareCTCDecoder
View on GitHub
compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder
☆20Jul 10, 2018Updated 8 years ago
CPBridge / RIFeatures
View on GitHub
A small C++ library for efficient calculation of rotation invariant features in 2D images using OpenCV.
☆12Feb 12, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FlorinAndrei / misc
View on GitHub
a catch-all repo
☆11Dec 28, 2023Updated 2 years ago
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Jul 11, 2026Updated 2 weeks ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
CUNY-CL / wikipron-modeling
View on GitHub
Proposed splits for the LREC Wikipron paper
☆15Apr 7, 2020Updated 6 years ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
NickRuiz / power-asr
View on GitHub
Phonetically-Oriented Word Error Rate
☆36May 4, 2019Updated 7 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
EricWilbanks / faseAlign
View on GitHub
Command line tool for forced-alignment of Spanish speech data
☆13Dec 31, 2025Updated 6 months ago
easonnie / ResEncoder
View on GitHub
This repo is for residual-connected sentence encoder for NLI.
☆11Jan 21, 2018Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MasonPhonLab / MAPS
View on GitHub
Mason-Alberta Phonetic Segmenter
☆15Feb 24, 2026Updated 5 months ago
abdfahim / audioprocessing
View on GitHub
Standard libraries for audio processing, especially STFT and Spherical Harmonics decomposition of a soundfield.
☆10Nov 29, 2021Updated 4 years ago
nessessence / Kaldi_ASR_Tutorial
View on GitHub
speech recognition using Kaldi framework
☆12Dec 25, 2019Updated 6 years ago
b-flo / warp-transducer
View on GitHub
A fast parallel implementation of RNN Transducer.
☆12Apr 8, 2025Updated last year
lex4all / lex4all
View on GitHub
pronunciation LEXicons for Any Low-resource Language
☆21Jul 14, 2020Updated 6 years ago
cetmann / robustness-interpretability
View on GitHub
Code for the Paper 'On the Connection Between Adversarial Robustness and Saliency Map Interpretability' by C. Etmann, S. Lunz, P. Maass, …
☆16May 9, 2019Updated 7 years ago
MokkeMeguru / TFGENZOO
View on GitHub
Library about construction helper for Generative models e.g. Flow-based Model with Tensorflow 2.x.
☆12Feb 16, 2023Updated 3 years ago
Heisenberg0391 / TextImageGenerator
View on GitHub
该脚本根据语料文件生成对应的图像文件，适用于文本识别等CV任务
☆29Aug 4, 2021Updated 4 years ago
0417keito / PromptTTS2
View on GitHub
[WIP] Unofficial Implementation of Microsoft's PromptTTS2
☆53Oct 31, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jondot / deep-learning-parameters-cheatsheet
View on GitHub
☆13Dec 4, 2017Updated 8 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
randommm / bevy-shoot-em-up
View on GitHub
A simple shoot 'em ups style game using Rust's Bevy crate https://play.marcoinacio.com
☆15Feb 25, 2024Updated 2 years ago
microspaze / FlutterBridge.Maui
View on GitHub
Flutter Bridge for .NET Maui
☆13Jul 12, 2024Updated 2 years ago
kiwi0fruit / enaml-video-app
View on GitHub
Easy to install cross-platform python desktop app that gets video via OpenCV and displays it via LGPL Qt 5 for Python (PySide2) GUI compo…
☆10Jul 18, 2019Updated 7 years ago
Kahsolt / TransTacoS-RetuneGAN
View on GitHub
A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.
☆15May 25, 2022Updated 4 years ago
Yuanbo2020 / Audio-Visual-VAD
View on GitHub
☆13May 9, 2022Updated 4 years ago
KoelLabs / ML
View on GitHub
Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…
☆24Jul 13, 2026Updated last week
numenta / hypersearch
View on GitHub
A particle swarm optimization library created by Numenta for hyperparameter optimization.
☆18Aug 18, 2015Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yhbcode000 / SinGlow
View on GitHub
SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …
☆11Oct 9, 2021Updated 4 years ago
bqplot / bqplot-gallery
View on GitHub
Gallery of applications built using bqplot and widget libraries like ipywidgets, ipydatagrid etc.
☆11Feb 1, 2023Updated 3 years ago
vittorione94 / ICP-Implementation
View on GitHub
Iterative Closest Point algorithm for scans/mesh alignment (with subsampling and point to plane improvements).
☆10Jul 15, 2018Updated 8 years ago
yao-matrix / deepSpeech2
View on GitHub
End-to-end speech recognition using TensorFlow
☆48Apr 2, 2018Updated 8 years ago
githubharald / CTCDecoder
View on GitHub
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…
☆837Jan 31, 2026Updated 5 months ago
Dedsec-Xu / DatasetImgLabel-ICDAR2015
View on GitHub
DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format
☆12Dec 7, 2019Updated 6 years ago
NeuroLIAA / visions
View on GitHub
Visual Search in Natural Scenes benchmark
☆20Sep 19, 2024Updated last year