Sequence Modelling with CTC
☆52Dec 29, 2022Updated 3 years ago
Alternatives and similar repositories for post--ctc
Users that are interested in post--ctc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- A small C++ library for efficient calculation of rotation invariant features in 2D images using OpenCV.☆12Feb 12, 2021Updated 5 years ago
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- unofficial implementation of YOLOP TensorRT☆13Dec 11, 2021Updated 4 years ago
- Use the MobileNet V2 as the basenet instead of the original VGG16☆14Aug 28, 2019Updated 6 years ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Mar 14, 2026Updated last week
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated last month
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 5 years ago
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- R Code recipes for Functional Data Analysis for phonetic analysis.☆13Jul 31, 2024Updated last year
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- ☆37Updated this week
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- Hypercorn is an ASGI and WSGI Server based on Hyper libraries and inspired by Gunicorn.☆14Jan 12, 2026Updated 2 months ago
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Scripts for training Kaldi for German speech recognition (ASR).☆27Feb 11, 2021Updated 5 years ago
- This repo is for residual-connected sentence encoder for NLI.☆11Jan 21, 2018Updated 8 years ago
- Converts Quora's new NLU dataset to SNLI txt/jsonl format, plus test/dev split, tokenization.☆14Jan 27, 2017Updated 9 years ago
- [ICASSP 2025] PDSeg: Patch-Wise Distillation and Controllable Image Generation for Weakly-Supervised Histopathology Tissue Segmentation☆18May 23, 2025Updated 10 months ago
- 5GTANGO Smart Manufacturing Pilot☆13May 1, 2023Updated 2 years ago
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16May 23, 2025Updated 10 months ago
- speech recognition using Kaldi framework☆12Dec 25, 2019Updated 6 years ago
- 该脚本根据语料文件生成对应的图像文件,适用于文本识别等CV任务☆29Aug 4, 2021Updated 4 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- ☆13Dec 4, 2017Updated 8 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- Easy to install cross-platform python desktop app that gets video via OpenCV and displays it via LGPL Qt 5 for Python (PySide2) GUI compo…☆10Jul 18, 2019Updated 6 years ago
- A particle swarm optimization library created by Numenta for hyperparameter optimization.☆18Aug 18, 2015Updated 10 years ago
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- ☆13May 9, 2022Updated 3 years ago
- Download and preperation tool for free speech corpora.☆16Apr 28, 2019Updated 6 years ago
- ☆14Jun 17, 2024Updated last year
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27May 25, 2023Updated 2 years ago
- Functional-Level Virtualization in 5G Core Network☆13Jun 9, 2018Updated 7 years ago
- Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…☆835Jan 31, 2026Updated last month
- Pretiffy the WBS UI☆11Jul 8, 2025Updated 8 months ago