jsoma / natural-pdfLinks
A friendly library for working with PDFs
☆40Updated 3 weeks ago
Alternatives and similar repositories for natural-pdf
Users that are interested in natural-pdf are comparing it to the libraries listed below
Sorting:
- semantic search for your spreadsheets☆54Updated last week
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated 8 months ago
- ☆22Updated 6 months ago
- ☆21Updated 3 months ago
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆38Updated last year
- An extremely fast FEC filing parser written in C☆78Updated 2 weeks ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆23Updated 7 months ago
- ☆11Updated 6 months ago
- ☆15Updated last year
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated 2 years ago
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆37Updated 2 months ago
- Scrapers for U.S. county court sites.☆73Updated 2 years ago
- Easily download U.S. census maps☆33Updated 2 years ago
- AI agent for enhancing datasets with information from the internet☆18Updated last month
- The repository for the NICAR 2024 class, SELECT * FROM interesting☆17Updated last year
- Workbook to teach the concept of risk ratios for data journalism applications☆33Updated 3 years ago
- A Python client for the Flightradar24 API with CLI support. Fetch, plot and analyze flight data with ease.☆17Updated 2 months ago
- A build tool by and for the Los Angeles Times☆29Updated 4 months ago
- yet another foia automation service☆44Updated 3 years ago
- A general purpose tool for text-based crosswalking☆108Updated last year
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- A step-by-step guide to publishing a standalone story from a dataset.☆30Updated last week
- A demonstration of how to build and publish pages with the baker build tool☆21Updated last year
- ReproZip for the Preservation of Web Applications☆17Updated last year
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated last year
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on scraping web data using Python.☆26Updated last year
- Docs and info from my 2018 workshop at the CAR conference☆29Updated 7 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated this week
- NTSB / National Transportation Safety Board docket scraper☆12Updated 6 years ago