jsoma / natural-pdfLinks
A friendly library for working with PDFs
☆35Updated this week
Alternatives and similar repositories for natural-pdf
Users that are interested in natural-pdf are comparing it to the libraries listed below
Sorting:
- semantic search for your spreadsheets☆48Updated this week
- ☆22Updated 6 months ago
- A tutorial on optical character recognition using tesseract, ImageMagick and other open source tools☆69Updated 7 months ago
- Teaching guide for a one-hour hands-on session at an IRE/NICAR conference on using pandas to analyze data.☆23Updated 6 months ago
- A collection of cheat sheets for remembering common commands and tips for data journalism work.☆38Updated last year
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12Updated 2 years ago
- 🎓 Practical beginner-level introductions to using different tools and technologies, with a focus on their application in the newsroom☆82Updated 2 years ago
- ☆21Updated 2 months ago
- Inspect Element is a practitioner's guide to auditing algorithms and data-driven investigations☆36Updated last month
- An extremely fast FEC filing parser written in C☆77Updated 4 months ago
- AI agent for enhancing datasets with information from the internet☆18Updated 3 weeks ago
- The repository for the NICAR 2024 class, SELECT * FROM interesting☆17Updated last year
- Collaborative data collection tool developed by the Associated Press☆109Updated 2 years ago
- ☆15Updated last year
- An easy-to-use point-and-click geocoder 🌍📍☆15Updated 2 years ago
- A general purpose tool for text-based crosswalking☆108Updated last year
- A demonstration of how to build and publish pages with the baker build tool☆21Updated last year
- a general list of resources and articles for people interested in getting into data journalism☆16Updated 2 years ago
- yet another foia automation service☆44Updated 3 years ago
- NTSB / National Transportation Safety Board docket scraper☆12Updated 6 years ago
- A Python client for the Flightradar24 API with CLI support. Fetch, plot and analyze flight data with ease.☆17Updated last month
- Materials to reproduce findings in our stories, "Swinging the Vote?", and "To Gmail, Most Black Lives Matter Emails Are 'Promotions'"☆38Updated last year
- A quick repo with basic command line commands, plus a very brief CSVKit run through.☆16Updated last year
- Docs and info from my 2018 workshop at the CAR conference☆29Updated 7 years ago
- Nicar ML/NLP workshop by J Kao☆19Updated 6 years ago
- Workshops created by the Quartz AI Studio☆48Updated 4 years ago
- A step-by-step guide to publishing a standalone story from a dataset.☆30Updated 5 months ago
- Workbook to teach the concept of risk ratios for data journalism applications☆33Updated 3 years ago
- Code and methodology to produce the dataset in Grist's Misplaced Trust investigation☆16Updated last year
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago