carlosplanchon / betterhtmlchunking
View external linksLinks

BetterHTMLChunking is a Python library for intelligent HTML segmentation. It builds a DOM tree from raw HTML and extracts content-rich regions of interest, making content analysis effortless. Great for LLM based processing.
49Jan 30, 2026Updated 2 weeks ago

Alternatives and similar repositories for betterhtmlchunking

Users that are interested in betterhtmlchunking are comparing it to the libraries listed below

Sorting:

Are these results useful?