huridocs / pdf-table-of-contents-extractor
View external linksLinks

This project aims to extract Table of Contents (TOC) information from PDF files using the outputs generated by the pdf-document-layout-analysis service. By leveraging the segmentation and classification capabilities of the underlying analysis tool, this project automates the process of identifying and structuring the document's TOC.
20Feb 3, 2025Updated last year

Alternatives and similar repositories for pdf-table-of-contents-extractor

Users that are interested in pdf-table-of-contents-extractor are comparing it to the libraries listed below

Sorting:

Are these results useful?