lmullen / ocr-makefileLinks
A Makefile to run OCR on a batch of PDFs
☆40Updated 9 years ago
Alternatives and similar repositories for ocr-makefile
Users that are interested in ocr-makefile are comparing it to the libraries listed below
Sorting:
- Datasets for Historians☆91Updated 3 weeks ago
- Computational Historical Thinking: With Applications in R☆61Updated 5 years ago
- Elections data from the early American republic☆13Updated 6 years ago
- The (very nearly) simplest possible web notebook using R Markdown☆25Updated 8 years ago
- http://plain-text.co☆71Updated 4 years ago
- Analysis repository for "The Spine of American Law: Digital Text Analysis and U.S. Legal Practice"☆19Updated 4 months ago
- A human name parser☆55Updated 2 years ago
- Repository for D-Lab Working Group Files, Scripts, Wiki, Issues, etc.☆14Updated 10 years ago
- Wrapper for the Hypothes.is API☆19Updated 5 years ago
- ☆12Updated 9 years ago
- ARCHIVED An R client for 'HathiTrust' API☆8Updated 3 years ago
- Patterns in NYT production from 1987 to 2007☆11Updated 7 years ago
- US Presidential Elections since 1789☆74Updated 5 years ago
- An R package for the Wikidata API☆54Updated 4 years ago
- Stuff that goes in ~/.pandoc☆63Updated 6 years ago
- Simple text mining of journal articles from JSTOR's Data for Research service☆72Updated 8 years ago
- A repository for materials and issues related to the Joint Roadmap for Open Science Tools (JROST) itself as a community and project.☆22Updated 6 years ago
- A template for bootstrapping reproducible RMarkdown documents for data journalistic purposes.☆54Updated 2 years ago
- Archive of plain-text versions of US National Security Strategy documents.☆12Updated 8 years ago
- An R package providing WorldCat API communication, functions for validating and normalizing bibliographic codes, translation from call nu…☆24Updated 3 weeks ago
- Datasets and code for "Witch Trials" (Leeson and Russ 2018)☆32Updated 8 years ago
- An API client library for Wikimedia pageview data☆24Updated last year
- ARCHIVED☆12Updated 3 years ago
- CNN Transcripts 2000--2025☆23Updated 3 months ago
- Dataset and generative scripts for 3,200+ US Secretary of State visits (1905-present)☆20Updated 8 years ago
- A Vagrant VM for RStudio Server, used in teaching "Literary Data"☆12Updated 9 years ago
- Data Package Manager for R☆57Updated 8 years ago
- Use Unpaywall with R☆66Updated 10 months ago
- Historical and Contemporary Boundaries of the United States of America☆62Updated 3 years ago
- A work-in-progress guide showing how and why you should learn command-line tools (xsv, csvkit) to work with data☆19Updated 6 years ago