Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer
☆30Jun 25, 2021Updated 4 years ago
Alternatives and similar repositories for OCR-Metrics-CER-WER
Users that are interested in OCR-Metrics-CER-WER are comparing it to the libraries listed below
Sorting:
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Apr 30, 2025Updated 10 months ago
- Core libraries by the PRImA Research Lab☆16Jul 30, 2024Updated last year
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- 'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata☆13Jan 13, 2016Updated 10 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 10 months ago
- Layout analysis to find layout elements in documents (similar to P2PaLA)☆20Feb 27, 2026Updated last week
- Library to parse and create METS files, especially for Archivematica.☆23Feb 3, 2026Updated last month
- Python tools for Tesseract OCR training☆26May 2, 2022Updated 3 years ago
- Clone of https://gitlab.com/scripta/escriptorium.git with updates from UB Mannheim☆33Feb 12, 2026Updated 3 weeks ago
- Named entity annotation tool☆28Jul 6, 2023Updated 2 years ago
- Collection of OCR-related python tools and wrappers from @OCR-D☆133Feb 4, 2026Updated last month
- python library☆12Nov 25, 2025Updated 3 months ago
- OCR-D python tools☆33Aug 16, 2024Updated last year
- Input pipelines for large scale, sharded training of deep learning models.☆40Jun 18, 2019Updated 6 years ago
- A collection of OCR'd and machine-corrected Greek texts. This base repository contains Git submodules for the different works and an inve…☆11Nov 18, 2014Updated 11 years ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- Java command line tool to convert PAGE XML files with layout and text content to PDF☆10Apr 27, 2020Updated 5 years ago
- golang package to provide lightweight internal pub/sub for goroutines☆29Jan 23, 2014Updated 12 years ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- Implementation of freedesktop.org specifications.☆16Sep 5, 2016Updated 9 years ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- Curated list of CLI tools and plugins that help you use AI in Vim, Neovim, and the Terminal.☆23Updated this week
- Data MIDI Lab - A Node.js-based MIDI Controller and web frontend that allows you to generate MIDI notes and control data from arbitrary c…☆46Mar 2, 2013Updated 13 years ago
- Simple to use monitoring server application written in Go, extendable with scripts.☆11Aug 4, 2020Updated 5 years ago
- A synthetic training data generator for a text recognition CNN☆10Jul 8, 2019Updated 6 years ago
- Scripts, data and results for TEI Hackathon☆12Oct 31, 2015Updated 10 years ago
- A reliable diacritics database with their associated ASCII characters☆13May 3, 2020Updated 5 years ago
- This is fork of code.google.com/p/snappy-go.☆11Mar 8, 2015Updated 11 years ago
- Docker Bind 1.9 image with Webmin Interface☆11Oct 29, 2020Updated 5 years ago
- ☆11Feb 13, 2026Updated 3 weeks ago
- Two-Step Approach to OCR Post-Correction☆14May 24, 2024Updated last year
- ☆11Nov 14, 2021Updated 4 years ago
- Mongoose plugins search site☆19Mar 24, 2023Updated 2 years ago
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 3 months ago
- STRExp is a framework that provides Explainability (XAI) to Scene Text Recognition (STR) models.☆11Nov 27, 2023Updated 2 years ago
- a little nodejs server and script that extracts letters from images via tesseract☆19Mar 4, 2015Updated 11 years ago