ldenoue / pdftojson

using XPDF, pdftojson extracts text from PDF files as JSON, including word bounding boxes.
143Updated last year

Alternatives and similar repositories for pdftojson:

Users that are interested in pdftojson are comparing it to the libraries listed below