Python library to extract text from PDF, and default to OCR when text extraction fails.
☆62Oct 6, 2017Updated 8 years ago
Alternatives and similar repositories for doc_processing_toolkit
Users that are interested in doc_processing_toolkit are comparing it to the libraries listed below
Sorting:
- A basic spreadsheet to api engine☆43Aug 27, 2019Updated 6 years ago
- We use Tock to track and report our time at 18F☆124Nov 6, 2025Updated 4 months ago
- [DEPRECATED] Run cron jobs in a Cloud Foundry app.☆13Sep 6, 2017Updated 8 years ago
- A complete agency API program.☆12Apr 27, 2017Updated 8 years ago
- [DEPRECATED] Hubot script using the Slack Real Time Messaging and Web APIs to file GitHub issues☆16Feb 2, 2017Updated 9 years ago
- A lightweight pipeline, locally or in Lambda, for scanning things like HTTPS, third party service use, and web accessibility.☆388Aug 6, 2021Updated 4 years ago
- Turns legal citations in the DOM into links☆20Mar 15, 2017Updated 9 years ago
- A scaffold/generator to standardize 18F project setup☆26Sep 9, 2019Updated 6 years ago
- a Jekyll Plugin that generates a JSON file with data for all the Pages in your Site☆44Aug 28, 2016Updated 9 years ago
- CLI downloading for google documents☆14Oct 27, 2015Updated 10 years ago
- Embeddable forms to recruit research participants. Sends results to a Google Sheet, deployed via Google Tag Manager.☆14Jun 25, 2018Updated 7 years ago
- A simple example using R and D3.js for show the examples of SNA Course in Coursera☆32Jun 23, 2016Updated 9 years ago
- OpenControl content for Red Hat technologies☆16Jan 20, 2020Updated 6 years ago
- Cli interface to threatcrowd.org☆20Jul 6, 2017Updated 8 years ago
- Python client for Sailthru☆29Jan 8, 2026Updated 2 months ago
- A Slack bot to welcome new 18F hires with the authority and compassion of Mrs. Landingham☆188Sep 9, 2019Updated 6 years ago
- 2017 - 2018 Certificate Policy development and drafting for Federal Public Trust Device PKI.☆44Mar 19, 2024Updated 2 years ago
- A project focused on tools and best practices to supported federated data collection efforts☆29May 5, 2020Updated 5 years ago
- Generic RESTful Interface for databases☆36Feb 6, 2014Updated 12 years ago
- get the size of one or more URLs☆17Mar 25, 2015Updated 10 years ago
- A Jekyll template for project documentation☆106Dec 27, 2020Updated 5 years ago
- Public Maltego Transforms☆24May 24, 2017Updated 8 years ago
- How the federal .gov domain space is doing at best practices and policies.☆95Jun 9, 2020Updated 5 years ago
- A small repo of notes and scripts for collecting data on U.S. deadly force police incidents☆10Aug 9, 2015Updated 10 years ago
- pythonic interface to the courtlistener api☆20Oct 30, 2018Updated 7 years ago
- A simple interface for non-technical users — both authenticated and pseudonymous — to provide feedback for your GitHub-hosted project☆57Oct 4, 2022Updated 3 years ago
- Create and manage needs on GOV.UK☆16Aug 7, 2025Updated 7 months ago
- Allow anyone with a modern browser to stream a 1GB, 10GB, 100GB, or 1TB file over the Internet and into a happy home.☆15Jun 9, 2017Updated 8 years ago
- Easily crowdsource the analysis of your documents☆102Nov 7, 2017Updated 8 years ago
- Setup used for the PyCharm webinar☆20Jul 20, 2021Updated 4 years ago
- A simple script to look for and process all the federal data.json data inventories.☆46Mar 10, 2015Updated 11 years ago
- Tool for reading mhtml files and extracting images and text into separate files.☆28Sep 30, 2014Updated 11 years ago
- Dispatch is an application for cities to advertise their contract opportunities.☆11Jan 30, 2018Updated 8 years ago
- A Sam-Packaged AWS Lambda client to the scanii.com content processing service☆26Mar 4, 2026Updated 2 weeks ago
- ☆24Apr 17, 2023Updated 2 years ago
- Passive DNS server interface compliant to "Common Output Format"☆10Sep 19, 2016Updated 9 years ago
- ☆11Sep 29, 2015Updated 10 years ago
- Make templates and then make documents from templates: https://www.youtube.com/watch?v=sKhsy0e0lqk☆11Apr 8, 2015Updated 10 years ago
- Codelab and solution for the Android Things Weatherstation☆15May 26, 2021Updated 4 years ago