pqzx / html2docxLinks
Convert html to docx
☆83Updated last year
Alternatives and similar repositories for html2docx
Users that are interested in html2docx are comparing it to the libraries listed below
Sorting:
- Append/Concatenate .docx documents☆121Updated last year
- A utility to read and write PDFs with Python☆338Updated 3 years ago
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆193Updated this week
- Simplify DOCX files to JSON☆254Updated last year
- A Python tool to help extracting information from structured PDFs.☆422Updated this week
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆225Updated this week
- Python binding to Poppler-cpp pdf library☆113Updated last year
- Convert HTML to docx☆38Updated last year
- Python API for PDF documents☆125Updated last year
- ☆85Updated 6 months ago
- An extendable docx file format parser and converter☆193Updated 5 months ago
- Markdown to Docx converter☆38Updated 4 years ago
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆74Updated 10 months ago
- Mail merge for Office Open XML (docx) files without the need for Microsoft Office Word.☆278Updated last year
- Python zipfile extensions☆139Updated last year
- A light weight, zero dependency, minimal functionality excel read/writer python library☆316Updated last year
- Convert Word documents (.docx files) to HTML☆1,022Updated last month
- Pure-Python full-text search library☆646Updated last year
- A modern CSS selector implementation for BeautifulSoup☆250Updated 2 months ago
- Pure python implementation of identifying files based off their magic numbers☆219Updated 4 months ago
- A package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or…☆435Updated 5 months ago
- A pure python based utility to extract text and images from docx files.☆568Updated 7 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆271Updated last year
- Create, read, and modify Excel .xlsx files☆113Updated 5 years ago
- A simple export from xlsx format to html tables with keep cell formatting☆65Updated 10 months ago
- A Python implementation of the JSON5 data format☆262Updated 3 months ago
- A python library to make filling pdfs much easier☆154Updated last year
- A restricted execution environment for Python to run untrusted code.☆654Updated 3 weeks ago
- PyScreeze is a simple, cross-platform screenshot module for Python 2 and 3.☆216Updated last year
- Demos, examples and utilities using PyMuPDF☆687Updated last year