Scripts and results from our OCR roundup, available on Source
☆150Feb 20, 2019Updated 7 years ago
Alternatives and similar repositories for ocr_testing
Users that are interested in ocr_testing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆141Feb 22, 2021Updated 5 years ago
- ☆10Mar 10, 2019Updated 7 years ago
- Dense Article Dataset (DAD): A Benchmark Dataset for Document Layout Analysis☆16Jan 13, 2022Updated 4 years ago
- An SQL loader for datasets published via Socrata☆28Dec 8, 2022Updated 3 years ago
- This repository contains all the tools we are working with related to Chequeabot's ecosystem.☆15May 27, 2025Updated 11 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A repository for online OCRD training infrastructure.☆13Aug 20, 2020Updated 5 years ago
- Repository to use/train segmentation models for document layout analysis☆19Jan 13, 2022Updated 4 years ago
- Write data to files split by topic and rolled over on size or a timeout, files can be compressed using lzo, snappy or gzip☆11Jul 12, 2021Updated 4 years ago
- Command line tool to convert page layout files to the latest PAGE XML format. It supports all previous versions of the PAGE format as wel…☆24Jan 30, 2021Updated 5 years ago
- ☆10Mar 16, 2023Updated 3 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- carebot-tracker.js — Carebot's tracking component for Google Analytics events☆17Apr 19, 2016Updated 10 years ago
- Python-based tools for document analysis and OCR☆3,471May 22, 2021Updated 5 years ago
- College project about article http://www.cs.ust.hk/~quan/publications/yuan-deblur-siggraph07.pdf☆10Jan 25, 2013Updated 13 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is an OCR solution for receipts, invoices, etc.☆20May 24, 2020Updated 5 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆411Aug 10, 2024Updated last year
- a little nodejs server and script that extracts letters from images via tesseract☆19Mar 4, 2015Updated 11 years ago
- LINKED DATA QUALITY REPORTS☆41May 20, 2022Updated 4 years ago
- ☆25Apr 18, 2020Updated 6 years ago
- A module for accessing a XLSX spreadsheet as a JavaScript object.☆16Aug 25, 2019Updated 6 years ago
- ☆72Jun 13, 2018Updated 7 years ago
- ☆15Jun 22, 2020Updated 5 years ago
- OCR ACE tensorflow☆11Jul 5, 2019Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆126Apr 18, 2020Updated 6 years ago
- BitCurator NLP: Portal repository for the BitCurator NLP tools☆16Apr 12, 2022Updated 4 years ago
- Page to PAGE Layout Analysis Tool☆191Jan 17, 2022Updated 4 years ago
- Course Materials for DPI-691M - "Programming and Data for Policymakers"☆17Jan 17, 2026Updated 4 months ago
- The open-source engine that powers bigbuilder, the Los Angeles Times Data Desk's system for publishing standalone pages☆24Mar 30, 2020Updated 6 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Rotation and skew detection using DL.☆60May 29, 2018Updated 7 years ago
- A build tool by and for the Los Angeles Times☆30Oct 15, 2025Updated 7 months ago
- React/Redux Chartwerk editor.☆10Oct 5, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Apr 10, 2024Updated 2 years ago
- ☆18Sep 25, 2021Updated 4 years ago
- Project generator for use with the datakit framework.☆30Apr 13, 2026Updated last month
- Simple app using JNI to read files inside the app's Assets folder☆11Apr 8, 2016Updated 10 years ago
- Issuu scraper written in Python.☆17Jul 22, 2019Updated 6 years ago
- A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cl…☆1,085Oct 20, 2023Updated 2 years ago
- Clojure library exposing newline delimited files as lightning fast databases☆14Feb 27, 2025Updated last year