pqzx / html2docxLinks
Convert html to docx
☆86Updated last year
Alternatives and similar repositories for html2docx
Users that are interested in html2docx are comparing it to the libraries listed below
Sorting:
- Convert HTML to docx☆38Updated last year
- A Python tool to help extracting information from structured PDFs.☆427Updated last week
- Simplify DOCX files to JSON☆257Updated last year
- Append/Concatenate .docx documents☆125Updated last year
- Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.☆197Updated this week
- Python binding to Poppler-cpp pdf library☆114Updated last year
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆226Updated last week
- Python API for PDF documents☆124Updated last year
- A utility to read and write PDFs with Python☆338Updated 4 years ago
- Python zipfile extensions☆139Updated last year
- Simple python wrapper to convert HTML to PDF with headless Chrome via selenium☆74Updated last week
- Pure-Python full-text search library☆650Updated last year
- ☆85Updated 7 months ago
- A pure python based utility to extract text and images from docx files.☆575Updated 9 months ago
- A modern CSS selector implementation for BeautifulSoup☆260Updated last week
- Thin wrapper for "pandoc" (MIT)☆1,086Updated 3 weeks ago
- Markdown to Docx converter☆38Updated 4 years ago
- Pure python implementation of identifying files based off their magic numbers☆225Updated last week
- An extendable docx file format parser and converter☆194Updated 7 months ago
- mirror of https://hg.reportlab.com/hg-public/reportlab☆77Updated last week
- A Python implementation of the JSON5 data format☆262Updated this week
- A utility to read and write PDFs with Python☆74Updated last year
- Python client for Typesense: https://github.com/typesense/typesense☆231Updated last month
- ☆585Updated 2 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆272Updated last year
- Convert html to docx☆54Updated last week
- A package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or…☆444Updated 6 months ago
- Convert Word documents (.docx files) to HTML☆1,040Updated last month
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆86Updated 2 months ago
- Parse numbers written in natural language☆124Updated last year