Tooling to extract data from scanned paper forms OCR-ed by Tesseract using the HOCR standard.
☆85Mar 1, 2016Updated 10 years ago
Alternatives and similar repositories for whatwordwhere
Users that are interested in whatwordwhere are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of stemmers in Clojure☆21Jan 17, 2023Updated 3 years ago
- Tools for TICCL☆14Dec 12, 2025Updated 3 months ago
- An easy-to-use point-and-click geocoder 🌍📍☆15Jan 6, 2023Updated 3 years ago
- View HOCR files with Mirador☆29Sep 27, 2017Updated 8 years ago
- blocks template☆18Mar 28, 2021Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- An android app that shows the edges and light sources in the live feed from the phone's camera☆11Sep 11, 2017Updated 8 years ago
- Course in Document and Content Analysis.☆14Apr 18, 2020Updated 5 years ago
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Mar 6, 2020Updated 6 years ago
- ☆11Feb 13, 2026Updated last month
- Named Entity Recognition tool for Europeana Newspapers☆14Apr 5, 2018Updated 8 years ago
- Fork of dump1090-stream-parser. Takes SBS output from `dump1090` and puts it into a database.☆13Apr 16, 2019Updated 6 years ago
- R tools for journalists☆18Mar 9, 2018Updated 8 years ago
- ☆25Mar 18, 2013Updated 13 years ago
- Code & supporting data behind Pioneer Press stories and interactives.☆14Jan 16, 2018Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Ruby parser for electronic candidate, PAC and party campaign filings from the Federal Election Commission.☆15Feb 3, 2024Updated 2 years ago
- Deutsch Language Tool Kit☆12Aug 31, 2015Updated 10 years ago
- Natural language generation with hidden markov models (using hmmlearn)☆25Sep 24, 2016Updated 9 years ago
- Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer☆18Jul 31, 2017Updated 8 years ago
- OCR-D wrapper for detectron2 based segmentation models☆17May 1, 2025Updated 11 months ago
- An Editor for creating simple or complex OCR workflows☆17Jun 13, 2024Updated last year
- fork of tesseract for emscripten☆21Jul 21, 2015Updated 10 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Dec 16, 2023Updated 2 years ago
- Guess a person's gender by their first name. Caveats apply.☆18May 6, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Efficient hOCR tooling☆56Aug 18, 2025Updated 7 months ago
- Investigative tool for extracting relevant areas from many documents☆14Nov 17, 2015Updated 10 years ago
- An unambiguous dialect of ArchieML☆23Oct 27, 2023Updated 2 years ago
- Presentations, tutorials and data for the OCR workshop at LMU☆16Jun 2, 2017Updated 8 years ago
- Development version of ndlstm, multidimensional LSTMs for TensorFlow☆19Feb 20, 2018Updated 8 years ago
- Ergonomic line-by-line transcription of scanned text.☆54Feb 2, 2026Updated 2 months ago
- Next generation OCR engine based on LSTMs.☆51Apr 8, 2018Updated 8 years ago
- Some helpful bash profile functions for working with earth imagery☆33Mar 8, 2020Updated 6 years ago
- A list of inspirational and thought-provoking reads about women who code.☆10Nov 20, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Multi-dimensional LSTM implementation in TensorFlow☆22Sep 25, 2017Updated 8 years ago
- statdivlab's teaching materials for STAMPS @ the MBL in 2019☆11Jul 28, 2019Updated 6 years ago
- nicar 17: advanced pdf manipulation☆18Mar 4, 2017Updated 9 years ago
- Tools for managing deployment & operations of Common Search.☆12Aug 26, 2016Updated 9 years ago
- Text-Induced Corpus Clean-up☆20Jun 20, 2023Updated 2 years ago
- Test using WebWorkers to run D3 geo projection☆10Jul 2, 2018Updated 7 years ago
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year