jsoma / natural-pdfLinks
A friendly library for working with PDFs
☆33Updated this week
Alternatives and similar repositories for natural-pdf
Users that are interested in natural-pdf are comparing it to the libraries listed below
Sorting:
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated 6 months ago
- semantic search for your spreadsheets☆41Updated 3 weeks ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆23Updated 5 months ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated 2 years ago
- ☆22Updated 5 months ago
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆38Updated last year
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago
- 🎓 Practical beginner-level introductions to using different tools and technologies, with a focus on their application in the newsroom☆82Updated 2 years ago
- Docs and info from my 2018 workshop at the CAR conference☆29Updated 7 years ago
- yet another foia automation service☆44Updated 3 years ago
- Materials for an introductory workshop about Python GIS tools at #NICAR25 in Minneapolis.☆10Updated 5 months ago
- A demonstration of how to build and publish pages with the baker build tool☆21Updated 11 months ago
- Notes for my talk "Exploring the Radio Spectrum for News"☆13Updated 5 years ago
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆36Updated 3 weeks ago
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- a general list of resources and articles for people interested in getting into data journalism☆16Updated 2 years ago
- this is the code that goes along with the AJC story at https://www.ajc.com/news/state--regional-govt--politics/precinct-closures-harm-vot…☆13Updated 5 years ago
- A Python client for the Flightradar24 API with CLI support. Fetch, plot and analyze flight data with ease.☆17Updated last week
- JSON to geocode list of addresses in OpenRefine, using HERE and OpenStreetMap Nominatim APIs☆30Updated 6 months ago
- Code and methodology to produce the dataset in Grist's Misplaced Trust investigation☆16Updated last year
- How Quartz used AI to help reporters search the Mauritius Leaks☆47Updated 5 years ago
- A build tool by and for the Los Angeles Times☆29Updated 3 months ago
- ☆11Updated 5 months ago
- For students of https://projects.propublica.org/graphics/ida-propublica-data-institute☆26Updated 2 years ago
- The repository for the NICAR 2024 class, SELECT * FROM interesting☆17Updated last year
- Discover and parse results for jurisdictions that use Clarity-based election systems.☆37Updated 2 months ago
- An extremely fast FEC filing parser written in C☆76Updated 3 months ago
- ☆13Updated last year
- A tool for generating the scaffolding needed to create a project the Data Visuals way.☆58Updated 2 months ago
- A repository for collecting several simple datasets that track the impact of the Trump 47 regime☆55Updated last week