booktype / python-ooxmlLinks
Python library for parsing .docx (Office Open XML) files
☆51Updated 5 years ago
Alternatives and similar repositories for python-ooxml
Users that are interested in python-ooxml are comparing it to the libraries listed below
Sorting:
- Python 3 port of pdfminer☆187Updated 7 years ago
- An extendable docx file format parser and converter☆192Updated 4 months ago
- Python bindings for CHMLIB☆57Updated 4 months ago
- Text (source code) search engine with indexer and a front end web interface to search. Uses Python 3.☆126Updated 2 years ago
- Python library for manipulating Open Packaging Convention (OPC) files like .docx, .pptx, and .xslx☆47Updated 8 years ago
- Create, read, and modify Excel .xlsx files☆113Updated 5 years ago
- Python to JavaScript translator☆92Updated 8 years ago
- Un/packs an MHT (MHTML) archive into/from separate files, writing/reading them in directories to match their Content-Location.☆79Updated 3 years ago
- ☆60Updated 5 years ago
- Save ranges from Excel documents as images☆108Updated 4 years ago
- ☆42Updated last month
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆322Updated last year
- A fast, pure-Python, untyped, in-memory database engine, using Python syntax to manage data, instead of SQL, inspired by PyDbLite.☆20Updated 8 years ago
- Fast multi-keyword search engine for text strings☆257Updated last year
- Pure python Aho-Corasick library.☆219Updated 2 years ago
- A pure python module which implements the DES and Triple-DES encryption algorithms.☆178Updated 6 years ago
- Find and use proxy auto-config (PAC) files with Python and Requests.☆72Updated last month
- Python bindings for SQLCipher☆135Updated 3 years ago
- Convert Word documents (.docx files) to HTML☆1,007Updated 3 weeks ago
- Constants used in Chinese text processing☆377Updated 10 months ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- a quick and dirty script to convert a Word (docx) document to html.☆53Updated 4 years ago
- Convert html to docx☆83Updated last year
- Pond is a high performance object-pooling library for Python☆56Updated last year
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆46Updated 11 years ago
- A utility to read and write PDFs with Python☆338Updated 3 years ago
- A Python tool to help extracting information from structured PDFs.☆417Updated this week
- Phantompy is a headless WebKit engine with powerful pythonic api build on top of Qt5 Webkit☆612Updated 8 years ago
- Compiled PyV8 for Mac OS X☆101Updated 12 years ago
- A readability parser which can extract title, content, images from html pages☆87Updated 5 years ago