Hyland / DocumentFilters

Document Filters is an SDK for applications like content indexing, e-discovery, data migration, and feeding data into AI/ML models by extracting data from unstructured sources. It gives the ability to perform deep inspection, data extraction, output manipulation, and conversion for virtually any type of document, in any programming language.
19Updated last week

Related projects

Alternatives and complementary repositories for DocumentFilters