ocropus / ocropus4-eval
Tools for evaluating OCR performance relative to ground truth.
☆10Updated last year
Alternatives and similar repositories for ocropus4-eval:
Users that are interested in ocropus4-eval are comparing it to the libraries listed below
- DFKI Layout Detection for OCR-D☆47Updated 2 months ago
- Ergonomic line-by-line transcription of scanned text.☆50Updated 4 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- Python-based research framework for developing, organizing, and deploying Deep Learning models powered by Tensorflow.☆12Updated 2 years ago
- Storyfinder - A Browser Plugin and Server Backend for Personalized Knowledge- and Information Management☆15Updated 9 months ago
- A ruby gem to extract structured data from Google Local Search Results using the serpapi/bert-base-local-results model, enabling parsing,…☆15Updated last year
- ☆12Updated 8 months ago
- Home to jupyter notebooks for Mindee OSS projects☆15Updated 3 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆63Updated this week
- Discourse Analysis Tool Suite☆18Updated this week
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆22Updated 2 years ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- Open Access PDF harvester☆35Updated 8 months ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 9 months ago
- A visual tool to interpret and understand PyTorch machine learning models☆15Updated 11 months ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 4 months ago
- Run embedding models using ONNX☆28Updated 11 months ago
- Scrollership through 20m pubmed abstracts.☆26Updated last year
- ☆67Updated 10 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆19Updated last month
- A browser extension providing Open Access bibliographical services☆14Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated 10 months ago
- Logical structure analysis for visually structured documents☆85Updated 2 years ago
- examples and guides to using Nomic Atlas☆27Updated 4 months ago
- Recognize text using Calamari OCR and the OCR-D framework☆14Updated 2 months ago
- Conversions between various OCR formats☆73Updated last year
- Corpus Build OCR platform☆8Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Updated 5 months ago