Goldziher / kreuzbergLinks

Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
2,080Updated last week

Alternatives and similar repositories for kreuzberg

Users that are interested in kreuzberg are comparing it to the libraries listed below

Sorting: