booktype / python-ooxml
Python library for parsing .docx (Office Open XML) files
☆51Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for python-ooxml
- Python library for manipulating Open Packaging Convention (OPC) files like .docx, .pptx, and .xslx☆42Updated 7 years ago
- Create, read, and modify Excel .xlsx files☆103Updated 4 years ago
- An extendable docx file format parser and converter☆190Updated 4 years ago
- Python binding to libpoppler-qt5☆42Updated last year
- Python bindings for CHMLIB☆55Updated last year
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆100Updated last year
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆252Updated 9 months ago
- Convert html to docx☆74Updated 4 months ago
- Python Domain Specific Language Tools☆83Updated 2 years ago
- Fast multi-keyword search engine for text strings☆247Updated 2 months ago
- Python 3 port of pdfminer☆189Updated 6 years ago
- Pure python Aho-Corasick library.☆212Updated last year
- A fast, pure-Python, untyped, in-memory database engine, using Python syntax to manage data, instead of SQL, inspired by PyDbLite.☆20Updated 7 years ago
- ☆60Updated 5 years ago
- Un/packs an MHT (MHTML) archive into/from separate files, writing/reading them in directories to match their Content-Location.☆80Updated 2 years ago
- Python to JavaScript translator☆92Updated 7 years ago
- An efficient simhash implementation for python☆125Updated 5 years ago
- Python extension module for accelerating regular expressions using libesm☆132Updated last year
- A simple parser for the python difflib ndiff that returns objects representing diff between two filew☆30Updated 5 years ago
- A simple python wrapper for PDFium.☆15Updated 2 years ago
- Python module for JSON data encoding, including jsonlint. See the project Wiki here on Github. Also read the README at the bottom of th…☆301Updated 4 years ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆168Updated this week
- PyOO allows you to control a running OpenOffice or LibreOffice program for reading and writing spreadsheet documents☆101Updated 5 years ago
- Command-line tool for exploring and diagnosing problems with Microsoft Office Open XML files (.docx, .pptx, .xlsx)☆47Updated last month
- 大规模中文语料☆38Updated 5 years ago
- ☆72Updated 2 years ago
- Chrome Debugging client for Python☆32Updated 5 years ago
- proxy server in python with upstream support☆49Updated 10 years ago
- Utilities for working with Excel files that require both xlrd and xlwt.☆273Updated 5 years ago
- tornado-crontab is a library that can make the task apps like crontab.☆28Updated last year