booktype / python-ooxmlLinks
Python library for parsing .docx (Office Open XML) files
☆52Updated 5 years ago
Alternatives and similar repositories for python-ooxml
Users that are interested in python-ooxml are comparing it to the libraries listed below
Sorting:
- Python 3 port of pdfminer☆187Updated 7 years ago
- Text (source code) search engine with indexer and a front end web interface to search. Uses Python 3.☆126Updated 2 years ago
- Create, read, and modify Excel .xlsx files☆113Updated 5 years ago
- bamboo是一个中文语言处理系统。☆14Updated 14 years ago
- Python bindings for CHMLIB☆57Updated 5 months ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆325Updated last year
- An extendable docx file format parser and converter☆192Updated 5 months ago
- Python bindings for SQLCipher☆135Updated 3 years ago
- ☆42Updated 2 months ago
- A pure python based utility to extract text and images from docx files.☆566Updated 7 months ago
- A wrapper library to read, manipulate and write data in xlsx and xlsm format using openpyxl☆119Updated 6 months ago
- PyV8 is a python wrapper for Google V8 engine, it act as a bridge between the Python and JavaScript objects, and support to hosting Googl…☆59Updated 13 years ago
- A fast, pure-Python, untyped, in-memory database engine, using Python syntax to manage data, instead of SQL, inspired by PyDbLite.☆20Updated 8 years ago
- Constants used in Chinese text processing☆378Updated 10 months ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- Python workflow engine☆66Updated 4 years ago
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆47Updated 11 years ago
- Python to JavaScript translator☆92Updated 8 years ago
- A readability parser which can extract title, content, images from html pages☆87Updated 5 years ago
- Utilities for working with Excel files that require both xlrd and xlwt.☆272Updated 6 years ago
- Python library for manipulating Open Packaging Convention (OPC) files like .docx, .pptx, and .xslx☆47Updated 8 years ago
- Python/JavaScript bridge module, making use of Mozilla's spidermonkey JavaScript implementation.☆304Updated 8 years ago
- Python module for JSON data encoding, including jsonlint. See the project Wiki here on Github. Also read the README at the bottom of th…☆306Updated 5 years ago
- Chrome Debugging client for Python☆33Updated 6 years ago
- A utility to read and write PDFs with Python☆338Updated 3 years ago
- Python bindings for WPS Office RPC (for Linux)☆268Updated 7 months ago
- A Python package to enable Unicode support when running Python from Windows console.☆102Updated 4 years ago
- Structure-aware diff for html and xml documents☆89Updated 5 years ago
- Convert Word documents (.docx files) to HTML☆1,020Updated last month
- Pure python Aho-Corasick library.☆220Updated 2 years ago