pqzx / html2docxLinks
Convert html to docx
☆84Updated last year
Alternatives and similar repositories for html2docx
Users that are interested in html2docx are comparing it to the libraries listed below
Sorting:
- Append/Concatenate .docx documents☆123Updated last year
- Convert HTML to docx☆38Updated last year
- ☆85Updated 6 months ago
- A Python tool to help extracting information from structured PDFs.☆425Updated last week
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆226Updated 2 weeks ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆195Updated last week
- Python binding to Poppler-cpp pdf library☆114Updated last year
- A utility to read and write PDFs with Python☆338Updated 4 years ago
- Pure-Python full-text search library☆649Updated last year
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆74Updated 11 months ago
- A python library to make filling pdfs much easier☆154Updated last year
- Pure python implementation of identifying files based off their magic numbers☆221Updated 5 months ago
- Convert html to docx☆52Updated last week
- A modern CSS selector implementation for BeautifulSoup☆252Updated 3 months ago
- Simplify DOCX files to JSON☆257Updated last year
- Python API for PDF documents☆125Updated last year
- mirror of https://hg.reportlab.com/hg-public/reportlab☆76Updated 3 weeks ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆98Updated 9 months ago
- Truly universal encoding detector in pure Python.☆721Updated this week
- A light weight, zero dependency, minimal functionality excel read/writer python library☆315Updated last year
- Convert Word documents (.docx files) to HTML☆1,034Updated 2 weeks ago
- Small, dependency-free, fast Python package to infer binary file types checking the magic numbers signature☆744Updated 7 months ago
- A simple export from xlsx format to html tables with keep cell formatting☆65Updated 10 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆271Updated last year
- A restricted execution environment for Python to run untrusted code.☆675Updated last month
- A pure python based utility to extract text and images from docx files.☆573Updated 8 months ago
- Read SVG files and convert them to other formats.☆350Updated last week
- An extendable docx file format parser and converter☆193Updated 6 months ago
- Allowlist-based HTML cleaner☆153Updated 5 months ago
- A Python implementation of the JSON5 data format☆262Updated 3 months ago