StabRise / spark-pdf

PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it
48Updated this week

Alternatives and similar repositories for spark-pdf:

Users that are interested in spark-pdf are comparing it to the libraries listed below