Sequence Modelling with CTC
☆52Dec 29, 2022Updated 3 years ago
Alternatives and similar repositories for post--ctc
Users that are interested in post--ctc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 6 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- A small C++ library for efficient calculation of rotation invariant features in 2D images using OpenCV.☆12Feb 12, 2021Updated 5 years ago
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- unofficial implementation of YOLOP TensorRT☆12Dec 11, 2021Updated 4 years ago
- Use the MobileNet V2 as the basenet instead of the original VGG16☆14Aug 28, 2019Updated 6 years ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆24Jun 27, 2026Updated last week
- Pybind11 bindings for Kaldi☆15Jun 22, 2026Updated last week
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 6 years ago
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- Python script to download all Creative Commons licensed videos from a Youtube channel☆14Sep 25, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- R Code recipes for Functional Data Analysis for phonetic analysis.☆13Jul 31, 2024Updated last year
- ☆37Jun 9, 2026Updated 3 weeks ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Datasets de los textos de cuentos de varios autorxs latinoamericanxs. Datasets benchmarks de distintas librerías de sentiment analysis en…☆18Sep 8, 2024Updated last year
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated last year
- Scripts for training Kaldi for German speech recognition (ASR).☆27Feb 11, 2021Updated 5 years ago
- This repo is for residual-connected sentence encoder for NLI.☆11Jan 21, 2018Updated 8 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- [ICASSP 2025] PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation☆18May 23, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated 4 months ago
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16May 23, 2025Updated last year
- speech recognition using Kaldi framework☆12Dec 25, 2019Updated 6 years ago
- Port of GGML to C#☆13Jul 1, 2023Updated 3 years ago
- 该脚本根据语料文件生成对应的图像文件,适用于文本识别等CV任务☆29Aug 4, 2021Updated 4 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- Flutter Bridge for .NET Maui☆13Jul 12, 2024Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆53Oct 31, 2023Updated 2 years ago
- ☆19Jun 28, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆13Dec 4, 2017Updated 8 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 4 years ago
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- ☆13May 9, 2022Updated 4 years ago
- Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…☆836Jan 31, 2026Updated 5 months ago
- End-to-end speech recognition using TensorFlow☆48Apr 2, 2018Updated 8 years ago
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago