Sequence Modelling with CTC
☆52Dec 29, 2022Updated 3 years ago
Alternatives and similar repositories for post--ctc
Users that are interested in post--ctc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 6 years ago
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Dec 8, 2022Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated 2 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Jul 10, 2018Updated 7 years ago
- A small C++ library for efficient calculation of rotation invariant features in 2D images using OpenCV.☆12Feb 12, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- Use the MobileNet V2 as the basenet instead of the original VGG16☆14Aug 28, 2019Updated 6 years ago
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated 3 months ago
- Proposed splits for the LREC Wikipron paper☆15Apr 7, 2020Updated 6 years ago
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- R Code recipes for Functional Data Analysis for phonetic analysis.☆13Jul 31, 2024Updated last year
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 7 years ago
- ☆37Apr 16, 2026Updated 2 weeks ago
- Khoá học Python for Data Analysis dành cho các bạn mới bắt đầu☆26Feb 25, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Neural network-based forced alignment with bidirectional attention mechanism☆78Jan 17, 2025Updated last year
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated 2 months ago
- [ICIP 2024] Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model☆16May 23, 2025Updated 11 months ago
- speech recognition using Kaldi framework☆12Dec 25, 2019Updated 6 years ago
- Port of GGML to C#☆13Jul 1, 2023Updated 2 years ago
- 该脚本根据语料文件生成对应的图像文件,适用于文本识别等CV任务☆29Aug 4, 2021Updated 4 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- Open image viewer is a hardware accelerated open code c++20 compliant cross platform 'C' library and application for viewing and manipula…☆35Updated this week
- Flutter Bridge for .NET Maui☆13Jul 12, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆55Oct 31, 2023Updated 2 years ago
- ☆13Dec 4, 2017Updated 8 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15May 25, 2022Updated 3 years ago
- ☆18Sep 15, 2020Updated 5 years ago
- Download and preperation tool for free speech corpora.☆16Apr 28, 2019Updated 7 years ago
- SinGlow is a part of my Singing voice synthesis system. It can extract features of sound, particularly songs and musics. Then we can use …☆11Oct 9, 2021Updated 4 years ago
- ☆13May 9, 2022Updated 3 years ago
- Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…☆835Jan 31, 2026Updated 3 months ago
- End-to-end speech recognition using TensorFlow☆48Apr 2, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Improving Tiny Vehicle Detection in Complex Scenes, ICME, 2018☆12Jul 7, 2018Updated 7 years ago
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- A Python implementation of Partial Least Squares (PLS) decomposition☆18Jul 14, 2021Updated 4 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- Incremental Learning with Adaptive Resonance Theory (ART) & Developmental Resonance networks☆13Dec 18, 2019Updated 6 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25May 6, 2019Updated 6 years ago
- tensorflow c++ example for VS2015☆31Jul 31, 2018Updated 7 years ago