booktype / python-ooxmlLinks
Python library for parsing .docx (Office Open XML) files
☆51Updated 5 years ago
Alternatives and similar repositories for python-ooxml
Users that are interested in python-ooxml are comparing it to the libraries listed below
Sorting:
- Python 3 port of pdfminer☆187Updated 6 years ago
- Create, read, and modify Excel .xlsx files☆112Updated 4 years ago
- An extendable docx file format parser and converter☆192Updated 3 months ago
- Text (source code) search engine with indexer and a front end web interface to search. Uses Python 3.☆126Updated 2 years ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- Fast multi-keyword search engine for text strings☆256Updated 11 months ago
- A fast, pure-Python, untyped, in-memory database engine, using Python syntax to manage data, instead of SQL, inspired by PyDbLite.☆20Updated 7 years ago
- Python library for manipulating Open Packaging Convention (OPC) files like .docx, .pptx, and .xslx☆46Updated 8 years ago
- A python module to generate xls/x files from a xls/x template.☆81Updated last year
- Python extension module for accelerating regular expressions using libesm☆132Updated last year
- Python bindings for CHMLIB☆58Updated 2 months ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆315Updated last year
- A readability parser which can extract title, content, images from html pages☆87Updated 5 years ago
- Python bindings for SQLCipher☆135Updated 3 years ago
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆46Updated 11 years ago
- A utility to read and write PDFs with Python☆337Updated 3 years ago
- Pure python Aho-Corasick library.☆217Updated 2 years ago
- A simple python wrapper for PDFium.☆17Updated 3 years ago
- Python to JavaScript translator☆92Updated 8 years ago
- Python library for extracting text from various file formats (for indexing).☆114Updated 3 years ago
- Phantompy is a headless WebKit engine with powerful pythonic api build on top of Qt5 Webkit☆613Updated 8 years ago
- ☆152Updated 9 years ago
- ☆41Updated last week
- web intrface for Advanced Python Scheduler☆53Updated 13 years ago
- 对微信网页授权获取用户信息的封装☆10Updated 10 years ago
- Personal clone of Poppler, official repository is here: https://gitlab.freedesktop.org/poppler/poppler☆129Updated 7 years ago
- Convert html to docx☆82Updated last year
- Python 3 bindings for SQLCipher☆150Updated last year
- A utility to read and write pdfs with Python. Superseded: see https://github.com/knowah/PyPDF2☆92Updated 9 years ago
- Thin Python wrapper of https://bellard.org/quickjs/☆199Updated last week