OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
☆262Jan 19, 2016Updated 10 years ago
Alternatives and similar repositories for OCRmyPDF
Users that are interested in OCRmyPDF are comparing it to the libraries listed below
Sorting:
- Administrator interface and tools for managing CKAN Data Catalogs.☆23Nov 5, 2015Updated 10 years ago
- A simple script to look for and process all the federal data.json data inventories.☆46Mar 10, 2015Updated 11 years ago
- This is a list of various datasets that are collected by States initially and then provided to federal agencies.☆20Dec 17, 2021Updated 4 years ago
- Training files produced for and by the Tesseract OCR engine for work on the Early Modern OCR Project (eMOP)☆37Sep 24, 2015Updated 10 years ago
- Open Data Portal Requirements☆14May 13, 2025Updated 10 months ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Sep 14, 2016Updated 9 years ago
- A pipeline for automated mapping of aggregate racial/ancestral groups - based on a 1976 map of Chicago☆21Oct 17, 2017Updated 8 years ago
- Lib flatterer: A lib to make JSON flatterer☆17May 16, 2025Updated 10 months ago
- The OpenGov Foundation's tax filing, organizational and legal documents☆14Apr 16, 2018Updated 7 years ago
- Friendly Slack bot for looking up cases☆21Dec 19, 2017Updated 8 years ago
- Data on 268 New York City traffic deaths in 2014.☆10Feb 19, 2015Updated 11 years ago
- Resolve data table conflicts☆17Jun 11, 2015Updated 10 years ago
- Tools for working with online critical apparatus in TEI☆11Sep 5, 2023Updated 2 years ago
- code to analyze the legal citation network☆25Sep 16, 2017Updated 8 years ago
- All of The OpenGov Foundation's legal docs in one externals-linked repo.☆23Oct 30, 2015Updated 10 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Oct 26, 2017Updated 8 years ago
- Publishes the Service Manual on GOV.UK☆12Mar 13, 2026Updated last week
- An extensible system to keep track of boards & commissions details, the people appointed to those groups, any legislation they write, and…☆17Mar 10, 2026Updated last week
- Alpha for notify API. Sends emails/sms/printed content on behalf of government.☆15Feb 8, 2016Updated 10 years ago
- A contextual news development environment.☆49Dec 19, 2014Updated 11 years ago
- An introduction to Python - https://www.digitalgov.gov/event/online-intro-to-python/☆10Aug 2, 2017Updated 8 years ago
- explainer page for everyday-language legal terms☆14Feb 15, 2026Updated last month
- A repository for creating and maintaining a geospatial representation of U.S. electrical utilities' service territories☆19Nov 21, 2014Updated 11 years ago
- test whether SPDX expressions satisfy licensing criteria☆11Jan 7, 2025Updated last year
- A Jekyll plugin to test frontmatter on posts and other documents in a Jekyll site.☆29Mar 24, 2017Updated 8 years ago
- A no-frills open data portal built with node, express, and mongodb☆86Apr 17, 2017Updated 8 years ago
- ☆12Apr 30, 2015Updated 10 years ago
- Tracking the tools I've found useful☆14Feb 28, 2017Updated 9 years ago
- Extract networks of entities from journalistic reporting☆49Jul 17, 2023Updated 2 years ago
- A command line application for validating CSV files☆11Feb 16, 2016Updated 10 years ago
- ⚔️ M-x kill-all-the-thing ☠️☆10Oct 16, 2017Updated 8 years ago
- Download files from an Internet Archive collection or item☆17Jun 12, 2014Updated 11 years ago
- ☆12Jul 15, 2024Updated last year
- MOAI, an Open Access Server Platform for Institutional Repositories☆15Apr 21, 2023Updated 2 years ago
- Build R Packages using Travis CI Containers☆16Jul 7, 2017Updated 8 years ago
- R package to compute and visualize summary trees☆36Jan 13, 2016Updated 10 years ago
- JSON with datetime handling☆13May 13, 2017Updated 8 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Mar 24, 2015Updated 10 years ago
- ☆25Aug 20, 2025Updated 7 months ago