18F / doc_processing_toolkitLinks
Python library to extract text from PDF, and default to OCR when text extraction fails.
☆62Updated 7 years ago
Alternatives and similar repositories for doc_processing_toolkit
Users that are interested in doc_processing_toolkit are comparing it to the libraries listed below
Sorting:
- A basic spreadsheet to api engine☆42Updated 5 years ago
- Please check out https://github.com/18F/foia-hub/issues to track our work. This repo is for project wide discussion, blogging, and scratc…☆51Updated 7 years ago
- Turns legal citations in the DOM into links☆20Updated 8 years ago
- This small DATA Act pilot contains code that translates agency data to a uniform DATA act format.☆21Updated 9 years ago
- Easily crowdsource the analysis of your documents☆102Updated 7 years ago
- A Python web application for converting PDF forms into PDF-filling APIs☆46Updated 4 years ago
- We use Tock to track and report our time at 18F☆124Updated 2 weeks ago
- ReVAL: Reusable Validation Library - A Django App for validating data via API and web interface☆32Updated 3 years ago
- Collecting reports from Inspectors General across the US federal government.☆109Updated 4 years ago
- Friendly Slack bot for looking up cases☆21Updated 7 years ago
- framework for scraping legislative/government data☆85Updated 8 months ago
- Inter-agency Federal AI Personal Assistant Pilot☆45Updated 8 years ago
- ☆36Updated 7 years ago
- Comport is a tool for law enforcement agencies to open their data and be accountable to their citizens.☆23Updated 6 years ago
- (DEPRECATED) Parser for U.S. federal regulations and other regulatory information☆55Updated 6 years ago
- A complete agency API program.☆12Updated 8 years ago
- A toolkit for mapping networks of political and economic influence through diverse types of entities and their relations. Accessible at h…☆188Updated 4 years ago
- legacy backend for Open States☆87Updated 5 years ago
- A consolidated FOIA request hub.☆50Updated 6 years ago
- Importer for US Spending data☆33Updated 10 years ago
- A subscription service for city council legislative information, started in Philadelphia.☆58Updated 9 years ago
- Website to test data validation and submission process☆21Updated this week
- Automate your FOIAs. The real, production version.☆48Updated 7 years ago
- Suggestions, schedules, and other information about the Engineering Chapter's Tech Talk meetings.☆28Updated last year
- Unified Python bindings for Sunlight APIs☆66Updated 9 years ago
- A place to collect ideas for CfA health projects☆41Updated 9 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- “Let Me Get That Data For You” catalogs the machine-readable data on a given domain name. [RETIRED]☆102Updated 10 years ago
- A step-by-step guide to publishing a simple news application.☆76Updated 7 years ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.☆86Updated last year