aphp / edspdf

EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-learning-based approaches to classify text blocs between body and meta-data.
47Updated 3 months ago

Alternatives and similar repositories for edspdf

Users that are interested in edspdf are comparing it to the libraries listed below

Sorting: