datamade / pdf-textextractLinks
Docker Container for a Make-based, PDF extraction using OCR
☆13Updated last year
Alternatives and similar repositories for pdf-textextract
Users that are interested in pdf-textextract are comparing it to the libraries listed below
Sorting:
- yet another foia automation service☆43Updated 3 years ago
- semantic search for your spreadsheets☆56Updated this week
- Scrapes municipal data from Legistar websites☆47Updated last week
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆38Updated 2 years ago
- JSON to geocode list of addresses in OpenRefine, using HERE and OpenStreetMap Nominatim APIs☆30Updated 8 months ago
- POLITICO's system for managing civic data☆20Updated 2 years ago
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- Combine U.S. census data responsibly☆45Updated 2 years ago
- 🎓 Practical beginner-level introductions to using different tools and technologies, with a focus on their application in the newsroom☆82Updated 3 years ago
- Docs and info from my 2018 workshop at the CAR conference☆29Updated 7 years ago
- Nicar ML/NLP workshop by J Kao☆19Updated 6 years ago
- A demo project and template repository showing how I use SpatiaLite with Datasette for quick spatial analysis.☆16Updated last year
- a python parser for the .fec file format☆46Updated 5 months ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago
- An easy-to-use point-and-click geocoder 🌍📍☆15Updated 2 years ago
- A new version of the cook county jail scraper, inspired by the Supreme Chi-Town Coding Crew☆23Updated 2 years ago
- Learning text classification for journalists through DocHate tips☆10Updated 5 years ago
- ☆12Updated 6 years ago
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated 8 months ago
- 📒 Analyzing Data, the DataMade Way☆37Updated 4 years ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆23Updated 7 months ago
- Project generator for use with the datakit framework.☆28Updated last year
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated 2 years ago
- A place to collect scripts that help journalists do their jobs☆31Updated 8 years ago
- GIS data for the U.S.-Mexico border fence (perhaps a wall in the future)☆28Updated 8 years ago
- this is the code that goes along with the AJC story at https://www.ajc.com/news/state--regional-govt--politics/precinct-closures-harm-vot…☆13Updated 5 years ago
- NYC 311 complaints and demographic analysis☆42Updated 7 years ago
- Notes from the sessions I attended at NICAR 2018 in Chicago, Ill.☆13Updated 6 years ago
- A Python wrapper for the OpenFEC API.☆28Updated 5 years ago
- Voting Precinct Shapefiles in the United States☆100Updated 7 years ago