Docker Container for a Make-based, PDF extraction using OCR
☆14Jul 31, 2024Updated last year
Alternatives and similar repositories for pdf-textextract
Users that are interested in pdf-textextract are comparing it to the libraries listed below
Sorting:
- A Django application to archive real-time earthquake notifications from the USGS's Advanced National Seismic System☆14Jan 11, 2024Updated 2 years ago
- A Python package for extracting data and metadata from Esri REST API endpoints. It provides an API, CLI and web app for exporting feature…☆60Mar 7, 2026Updated 2 weeks ago
- Web Application that uses computer vision to teach you how to dance! (HackTheNorth Winner)☆10Sep 20, 2021Updated 4 years ago
- Source for state legislative district map tiles for openstates.org☆24Mar 12, 2026Updated last week
- A content-filtering bypass system developed specifically to allow access to trans-related resources on public networks (libraries, school…☆27Nov 15, 2014Updated 11 years ago
- A repository of code to scrape, clean, and update daily DHS data on people in shelters in NYC☆12Updated this week
- Easily install Python, pipenv and Pipfile packages in your GitHub Action☆18Jun 21, 2024Updated last year
- Connecting Conference Organizers and Speakers since 201x☆11Sep 16, 2016Updated 9 years ago
- React/Redux Chartwerk editor.☆10Oct 5, 2018Updated 7 years ago
- Easily download U.S. census maps☆35Feb 23, 2023Updated 3 years ago
- Combine U.S. census data responsibly☆46Feb 23, 2023Updated 3 years ago
- Functional Data Engineering tutorial in Python & Airflow.☆17Mar 24, 2023Updated 2 years ago
- Developers documentation☆21Dec 1, 2025Updated 3 months ago
- Here I'll work on a few Django based web projects for 💯 days continuously from 14-February-2018 to 24-May-2018. Django is a Python-based…☆14Dec 8, 2022Updated 3 years ago
- Data and code behind Planet Money "Modal American" episode.☆13Sep 3, 2019Updated 6 years ago
- transform a datapoint from a website into a CSV time-series dataset using the wayback machine☆12May 24, 2023Updated 2 years ago
- https://www.washingtonpost.com/national/how-trump-is-changing-the-face-of-legal-immigration/2018/07/02/477c78b2-65da-11e8-99d2-0d678ec08c…☆16Jul 2, 2018Updated 7 years ago
- Leaflet maps auto-generated from Google Docs!☆13Sep 9, 2017Updated 8 years ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Dec 8, 2022Updated 3 years ago
- A Flask app to document and test Slack's interactive messages.☆10Mar 19, 2021Updated 5 years ago
- ☆25Mar 18, 2013Updated 13 years ago
- The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.☆59May 19, 2025Updated 10 months ago
- Python wrapper around Washington State Legislative Web Services☆10Apr 19, 2019Updated 6 years ago
- State of the Unions for the rest of us☆19Jan 16, 2015Updated 11 years ago
- ☆10Mar 10, 2019Updated 7 years ago
- Puppeteer-first Super Fast Testing☆12Apr 26, 2023Updated 2 years ago
- Git-like data versioning.☆41Aug 27, 2023Updated 2 years ago
- Code base for the Woke Windows Project☆17Aug 7, 2025Updated 7 months ago
- A structured, open-source taxonomy for classifying open source software projects.☆31Updated this week
- ☆19Oct 11, 2017Updated 8 years ago
- Monoidal data processes.☆33Jan 7, 2023Updated 3 years ago
- Datasette of earning call transcripts from the Motley Fool☆15Apr 2, 2023Updated 2 years ago
- Notebook and companion R script for the "R Basics: Stats" session at NICAR 2016.☆11Mar 13, 2016Updated 10 years ago
- Learning text classification for journalists through DocHate tips☆10May 13, 2020Updated 5 years ago
- Quels élus de la République (députés, ministres, maires) utilisent toujours x.com ?☆14Feb 8, 2026Updated last month
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆32Dec 27, 2022Updated 3 years ago
- ☆27Nov 24, 2025Updated 3 months ago
- ☆12Mar 9, 2019Updated 7 years ago
- CLI for parsing FEC files, for federal campaign finance pipelines☆23Mar 9, 2026Updated 2 weeks ago