Yet Another Peg WikiText Parser
☆34Oct 31, 2012Updated 13 years ago
Alternatives and similar repositories for kiwi
Users that are interested in kiwi are comparing it to the libraries listed below
Sorting:
- ☆12Jan 7, 2023Updated 3 years ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 4 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Python (Cython) binding for harfbuzz an OpenType text shaping.☆19Aug 24, 2018Updated 7 years ago
- Augment line images for improving OCR datasets☆10Oct 4, 2023Updated 2 years ago
- ☆13Jan 25, 2019Updated 7 years ago
- Tesseract tessdata downloader from GitHub repositories☆11Sep 17, 2021Updated 4 years ago
- Stanford University cs224N course. Deep Learnign with NLP. Solutions in Python3.6☆10Jan 1, 2019Updated 7 years ago
- resources, links for OCR & greek☆10Mar 8, 2021Updated 4 years ago
- Download episodes from the Mako VOD service☆10Oct 3, 2018Updated 7 years ago
- ☆10Mar 16, 2023Updated 2 years ago
- A python package for Tensorflow. It has the right level of abstraction needed to built state-of-the-art deep learning models. Tensormodel…☆13Jul 5, 2016Updated 9 years ago
- The trivia game freshly generated from Wikipedia articles.☆31Nov 24, 2009Updated 16 years ago
- Rust wrapper for the cld2 language detection library.☆16Nov 28, 2017Updated 8 years ago
- This repository is for the Computer Vision Nano-degree Program from Udacity.☆11Aug 30, 2024Updated last year
- A command line tool to render HTML and text emails of markdown content.☆36Feb 4, 2026Updated last month
- Docker container for ocropus3 OCR system☆12Aug 19, 2018Updated 7 years ago
- Converts YouTube XML Annotations to ASS subtitles☆17Dec 17, 2018Updated 7 years ago
- Python SIP wrapper for libtesseract (Apache license)☆12Feb 20, 2017Updated 9 years ago
- Zig bindings for the `gtk4` library☆13Jan 20, 2023Updated 3 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Dec 6, 2025Updated 2 months ago
- image-segmentation and text-localization☆12Aug 22, 2018Updated 7 years ago
- Entity extraction from PDFs with Tesseract and Machine Learning☆10Mar 19, 2021Updated 4 years ago
- zlib with the build system replaced by zig☆15Apr 17, 2024Updated last year
- ☆13Apr 19, 2022Updated 3 years ago
- ☆13May 7, 2022Updated 3 years ago
- Git "recursive rebase".☆16May 15, 2020Updated 5 years ago
- Buffruneio provides rune-based buffered input☆30Nov 12, 2020Updated 5 years ago
- TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain…☆15Feb 12, 2024Updated 2 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Oct 24, 2016Updated 9 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated 10 months ago
- Find headers that are slow to compile☆12Mar 2, 2018Updated 8 years ago
- The Taamey D font for Biblical Hebrew☆17Jan 23, 2024Updated 2 years ago
- JBuild is a intentionally simple, small build tool for Java.☆22Feb 12, 2026Updated 3 weeks ago
- C11 parser with GNU C extensions written in C++14☆18Sep 1, 2018Updated 7 years ago
- golang AST matcher☆20Apr 19, 2022Updated 3 years ago
- The arabic letters began from my latin design called Changa published at google font catalog☆16Jun 7, 2018Updated 7 years ago
- A tensorflow implementation of the paper "Searching for MobileNetV3" with the R-ASPP segmentation head☆13Mar 24, 2023Updated 2 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago