18F / doc_processing_toolkit

Python library to extract text from PDF, and default to OCR when text extraction fails.
60Updated 7 years ago

Alternatives and similar repositories for doc_processing_toolkit:

Users that are interested in doc_processing_toolkit are comparing it to the libraries listed below