18F / doc_processing_toolkit

Python library to extract text from PDF, and default to OCR when text extraction fails.
60Updated 7 years ago

Related projects

Alternatives and complementary repositories for doc_processing_toolkit