ldenoue / pdftojson

using XPDF, pdftojson extracts text from PDF files as JSON, including word bounding boxes.
142Updated last year

Related projects

Alternatives and complementary repositories for pdftojson