booktype / python-ooxmlLinks
Python library for parsing .docx (Office Open XML) files
☆53Updated 5 years ago
Alternatives and similar repositories for python-ooxml
Users that are interested in python-ooxml are comparing it to the libraries listed below
Sorting:
- Python 3 port of pdfminer☆187Updated 7 years ago
- An extendable docx file format parser and converter☆195Updated 8 months ago
- Text (source code) search engine with indexer and a front end web interface to search. Uses Python 3.☆126Updated 2 years ago
- Create, read, and modify Excel .xlsx files☆114Updated 5 years ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- Python library for manipulating Open Packaging Convention (OPC) files like .docx, .pptx, and .xslx☆47Updated 8 years ago
- Convert a docx (OOXML) file to html. This project is deprecated in favor of https://github.com/OpenScienceFramework/pydocx☆47Updated 11 years ago
- Python CFFI wrapper for LibreOfficeKit☆56Updated 5 years ago
- Python module for JSON data encoding, including jsonlint. See the project Wiki here on Github. Also read the README at the bottom of th…☆306Updated 5 years ago
- Python binding to libpoppler-qt5☆43Updated 2 years ago
- 经过处理后可直接用于jieba的词典☆18Updated 5 years ago
- Pure python Aho-Corasick library.☆220Updated 3 weeks ago
- ☆45Updated 3 weeks ago
- Constants used in Chinese text processing☆386Updated last year
- Personal clone of Poppler, official repository is here: https://gitlab.freedesktop.org/poppler/poppler☆131Updated 7 years ago
- Python bindings for SQLCipher☆136Updated 3 years ago
- A fast, pure-Python, untyped, in-memory database engine, using Python syntax to manage data, instead of SQL, inspired by PyDbLite.☆20Updated 8 years ago
- web intrface for Advanced Python Scheduler☆53Updated 13 years ago
- A simple GUI for python's difflib to compare files and directories☆137Updated 5 years ago
- ☆61Updated 6 years ago
- A readability parser which can extract title, content, images from html pages☆85Updated 5 years ago
- A wrapper library to read, manipulate and write data in xlsx and xlsm format using openpyxl☆120Updated 9 months ago
- Structure-aware diff for html and xml documents☆89Updated 6 years ago
- Python tool for converting a JSON-style dictionary element to a XML document.☆22Updated 12 years ago
- Utilities for working with Excel files that require both xlrd and xlwt.☆272Updated 6 years ago
- cuobiezi http api☆54Updated 6 years ago
- Python library for extracting text from various file formats (for indexing).☆114Updated 4 years ago
- some useful tools functions☆76Updated 3 years ago
- Un/packs an MHT (MHTML) archive into/from separate files, writing/reading them in directories to match their Content-Location.☆79Updated 3 years ago
- Fast JavaScript parser for Python.☆257Updated 3 years ago