CenterForOpenScience / pydocxLinks
An extendable docx file format parser and converter
☆192Updated last month
Alternatives and similar repositories for pydocx
Users that are interested in pydocx are comparing it to the libraries listed below
Sorting:
- Python wrapper for Pandoc—the universal document converter.☆215Updated 9 years ago
- A library for extracting tables from PDF files☆90Updated 4 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆79Updated last year
- Convert Word documents (.docx files) to HTML☆961Updated 2 weeks ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆54Updated 5 months ago
- Python module to drive the awesome pdftk binary.☆149Updated 2 years ago
- PyOO allows you to control a running OpenOffice or LibreOffice program for reading and writing spreadsheet documents☆103Updated 6 years ago
- A utility to read and write pdfs with Python. Superseded: see https://github.com/knowah/PyPDF2☆90Updated 8 years ago
- Python CFFI wrapper for LibreOfficeKit☆56Updated 5 years ago
- Wraps any WSGI application and makes it easy to send test requests to that application, without starting up an HTTP server.☆340Updated 2 weeks ago
- Python binding to libpoppler-qt5☆43Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- Convert html to docx☆81Updated 11 months ago
- A simple, immutable URL class with a clean API for interrogation and manipulation.☆293Updated last year
- Create, read, and modify Excel .xlsx files☆111Updated 4 years ago
- Offering FullText Search of MySQL in SQLAlchemy☆91Updated 3 years ago
- Utilities for using XPath to map XML data to Python objects and Django forms☆39Updated 3 years ago
- Generates index page like mod_autoindex☆113Updated last year
- SimpleSQLite is a Python library to simplify SQLite database operations: table creation, data insertion and get data as other data format…☆133Updated 3 months ago
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,072Updated 9 years ago
- Command-line tool for exploring and diagnosing problems with Microsoft Office Open XML files (.docx, .pptx, .xlsx)☆53Updated 8 months ago
- CSS Selectors for Python☆298Updated last month
- a Python implementation of the Unicode Collation Algorithm☆220Updated last year
- PyTime is an easy-use Python module which aims to operate date/time/datetime by string.☆159Updated 2 years ago
- An extension for the Flask microframework that adds Sijax support.☆107Updated 10 years ago
- A wrapper library to read, manipulate and write data in xlsx and xlsm format using openpyxl☆118Updated last month
- Backport of Python 3's csv module for Python 2☆64Updated 4 years ago
- Python powered spreadsheets☆172Updated 6 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆305Updated last year
- An application server based on the Pyramid web framework (http://substanced.net)☆158Updated this week