aeksco / aws-pdf-textract-pipeline

Data pipeline for crawling PDFs from the Web and transforming their contents into structured data using AWS textract. Built with AWS CDK + TypeScript
163Updated 7 months ago

Alternatives and similar repositories for aws-pdf-textract-pipeline:

Users that are interested in aws-pdf-textract-pipeline are comparing it to the libraries listed below