booktype / python-ooxmlLinks
Python library for parsing .docx (Office Open XML) files
☆52Updated 5 years ago
Alternatives and similar repositories for python-ooxml
Users that are interested in python-ooxml are comparing it to the libraries listed below
Sorting:
- Python 3 port of pdfminer☆187Updated 7 years ago
- Create, read, and modify Excel .xlsx files☆113Updated 5 years ago
- Python library for manipulating Open Packaging Convention (OPC) files like .docx, .pptx, and .xslx☆47Updated 8 years ago
- Text (source code) search engine with indexer and a front end web interface to search. Uses Python 3.☆126Updated 2 years ago
- Python bindings for CHMLIB☆57Updated 5 months ago
- A wrapper library to read, manipulate and write data in xlsx and xlsm format using openpyxl☆119Updated 6 months ago
- An extendable docx file format parser and converter☆193Updated 6 months ago
- ☆153Updated 9 years ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- A readability parser which can extract title, content, images from html pages☆87Updated 5 years ago
- An auto-reload module for python app.☆12Updated 11 years ago
- A fast, pure-Python, untyped, in-memory database engine, using Python syntax to manage data, instead of SQL, inspired by PyDbLite.☆20Updated 8 years ago
- Fast multi-keyword search engine for text strings☆258Updated last year
- Python to JavaScript translator☆92Updated 8 years ago
- Python module for JSON data encoding, including jsonlint. See the project Wiki here on Github. Also read the README at the bottom of th…☆306Updated 5 years ago
- analyzer adapter for solr 5, we support Jieba, and stranford in the future☆61Updated 7 years ago
- a quick and dirty script to convert a Word (docx) document to html.☆53Updated 4 years ago
- Python binding to libpoppler-qt5☆43Updated 2 years ago
- Phantompy is a headless WebKit engine with powerful pythonic api build on top of Qt5 Webkit☆612Updated 8 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆327Updated last year
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆47Updated 11 years ago
- Structure-aware diff for html and xml documents☆89Updated 6 years ago
- Un/packs an MHT (MHTML) archive into/from separate files, writing/reading them in directories to match their Content-Location.☆79Updated 3 years ago
- A simple GUI for python's difflib to compare files and directories☆137Updated 4 years ago
- pyregex is a Python Regular Expression Online Tester☆296Updated 4 years ago
- web intrface for Advanced Python Scheduler☆53Updated 13 years ago
- Constants used in Chinese text processing☆378Updated 11 months ago
- Python bindings for SQLCipher☆136Updated 3 years ago
- Language Savant, Python clone of github/linguist.☆153Updated 5 years ago
- Utilities for working with Excel files that require both xlrd and xlwt.☆272Updated 6 years ago