Goldziher / kreuzbergLinks

Document intelligence framework for Python - Extract text, metadata, and structured data from PDFs, images, Office documents, and more. Built on Pandoc, PDFium, and Tesseract.
1,965Updated this week

Alternatives and similar repositories for kreuzberg

Users that are interested in kreuzberg are comparing it to the libraries listed below

Sorting: