Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.
☆83Mar 1, 2016Updated 10 years ago
Alternatives and similar repositories for whatwordwhere
Users that are interested in whatwordwhere are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OCRopus model for Gothic print (Fraktur)☆19Feb 16, 2020Updated 6 years ago
- NICAR 2016 talk about PDFs!☆63Mar 12, 2016Updated 10 years ago
- Tools for TICCL☆14Dec 12, 2025Updated 6 months ago
- An easy-to-use point-and-click geocoder 🌍📍☆15Jan 6, 2023Updated 3 years ago
- View HOCR files with Mirador☆29Sep 27, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- blocks template☆18Mar 28, 2021Updated 5 years ago
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Mar 6, 2020Updated 6 years ago
- ☆11Feb 13, 2026Updated 4 months ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Fork of dump1090-stream-parser. Takes SBS output from `dump1090` and puts it into a database.☆13Apr 16, 2019Updated 7 years ago
- Manuals, lexica, OCR test data for PoCoTo and the profiler☆15Jul 2, 2021Updated 4 years ago
- ☆25Mar 18, 2013Updated 13 years ago
- Code & supporting data behind Pioneer Press stories and interactives.☆14Jan 16, 2018Updated 8 years ago
- A Ruby parser for electronic candidate, PAC and party campaign filings from the Federal Election Commission.☆15Feb 3, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deutsch Language Tool Kit☆12Aug 31, 2015Updated 10 years ago
- Natural language generation with hidden markov models (using hmmlearn)☆25Sep 24, 2016Updated 9 years ago
- Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer☆18Jul 31, 2017Updated 8 years ago
- fork of tesseract for emscripten☆21Jul 21, 2015Updated 10 years ago
- Guess a person's gender by their first name. Caveats apply.☆18May 6, 2023Updated 3 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆38Dec 16, 2023Updated 2 years ago