Python library to extract text from PDF, and default to OCR when text extraction fails.
☆62Oct 6, 2017Updated 8 years ago
Alternatives and similar repositories for doc_processing_toolkit
Users that are interested in doc_processing_toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Crawl a site, run pa11y on every HTML page, and get the results☆18Sep 27, 2016Updated 9 years ago
- We use Tock to track and report our time at 18F☆124Nov 6, 2025Updated 7 months ago
- [DEPRECATED] Run cron jobs in a Cloud Foundry app.☆13Sep 6, 2017Updated 8 years ago
- A complete agency API program.☆12Apr 27, 2017Updated 9 years ago
- [DEPRECATED] Hubot script using the Slack Real Time Messaging and Web APIs to file GitHub issues☆16Feb 2, 2017Updated 9 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- make it easy to turn a lot of potentially large csv files into easily accessible open data☆197Nov 2, 2016Updated 9 years ago
- A lightweight pipeline, locally or in Lambda, for scanning things like HTTPS, third party service use, and web accessibility.☆388Aug 6, 2021Updated 4 years ago
- Turns legal citations in the DOM into links☆20Mar 15, 2017Updated 9 years ago
- A scaffold/generator to standardize 18F project setup☆26Sep 9, 2019Updated 6 years ago
- Sharing a viewer we built for WNYC.☆12May 10, 2011Updated 15 years ago
- a Jekyll Plugin that generates a JSON file with data for all the Pages in your Site☆44Aug 28, 2016Updated 9 years ago
- CLI downloading for google documents☆14Oct 27, 2015Updated 10 years ago
- Cloud Application Registry☆16Nov 7, 2017Updated 8 years ago
- a quick python helper that generates a big.js presentation☆27Nov 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Apr 5, 2016Updated 10 years ago
- Ruby access to the SAM.gov API☆13Mar 25, 2017Updated 9 years ago
- OpenControl content for Red Hat technologies☆16Jan 20, 2020Updated 6 years ago
- Allow anyone with a modern browser to stream a 1GB, 10GB, 100GB, or 1TB file over the Internet and into a happy home.☆32Oct 7, 2018Updated 7 years ago
- Cli interface to threatcrowd.org☆21Jul 6, 2017Updated 8 years ago
- hubot plugin: (query) interface to google spreadsheet (*with* authentication)☆13May 28, 2020Updated 6 years ago
- A Slack bot to welcome new 18F hires with the authority and compassion of Mrs. Landingham