18F / doc_processing_toolkit
Python library to extract text from PDF, and default to OCR when text extraction fails.
☆60Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for doc_processing_toolkit
- A basic spreadsheet to api engine☆42Updated 5 years ago
- A Python web application for converting PDF forms into PDF-filling APIs☆46Updated 3 years ago
- Please check out https://github.com/18F/foia-hub/issues to track our work. This repo is for project wide discussion, blogging, and scratc…☆51Updated 6 years ago
- We use Tock to track and report our time at 18F☆120Updated last week
- Turns legal citations in the DOM into links☆20Updated 7 years ago
- Importer for US Spending data☆33Updated 10 years ago
- ☆36Updated 7 years ago
- framework for scraping legislative/government data☆85Updated 2 months ago
- A complete agency API program.☆12Updated 7 years ago
- A deprecated Python wrapper for the DocumentCloud API☆63Updated 3 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 9 years ago
- ReVAL: Reusable Validation Library - A Django App for validating data via API and web interface☆32Updated 3 years ago
- Monitor datasets, gets alerts when something happens☆211Updated 5 years ago
- Another home for the Sunlight Foundation's Open Data Policy research.☆98Updated 7 years ago
- Slides for 18F - built automatically using Federalist☆30Updated 6 years ago
- Website to test data validation and submission process☆22Updated last week
- Friendly Slack bot for looking up cases☆20Updated 6 years ago
- Collecting reports from Inspectors General across the US federal government.☆107Updated 3 years ago
- Scrapers for US municipal governments.☆101Updated 5 months ago
- legacy backend for Open States☆87Updated 4 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- A step-by-step guide to publishing a simple news application.☆76Updated 6 years ago
- Comport is a tool for law enforcement agencies to open their data and be accountable to their citizens.☆23Updated 5 years ago
- OSSSM (awesome). Open source short-term secure messaging☆110Updated last year
- Python workers that collect tweets from the twitter streaming api and track deletions☆121Updated last year
- Manage and display public record requests, built by the Code for America 2013 Oakland team, maintained by @richaagarwal☆60Updated 3 years ago
- [No longer supported] Look at the new open source guidance repo☆9Updated 7 years ago
- A project focused on tools and best practices to supported federated data collection efforts☆28Updated 4 years ago
- A subscription service for city council legislative information, started in Philadelphia.☆58Updated 9 years ago