CenterForOpenScience / pydocxLinks
An extendable docx file format parser and converter
☆194Updated 7 months ago
Alternatives and similar repositories for pydocx
Users that are interested in pydocx are comparing it to the libraries listed below
Sorting:
- A library for extracting tables from PDF files☆91Updated 5 years ago
- Python module to drive the awesome pdftk binary.☆151Updated 2 years ago
- Python wrapper for Pandoc—the universal document converter.☆215Updated 9 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆85Updated last year
- A wrapper library to read, manipulate and write data in xlsx and xlsm format using openpyxl☆120Updated 8 months ago
- Customizable Flask - SQLAlchemy - Whoosh integration☆85Updated last year
- Generate PDF files out of your Flask website thanks to WeasyPrint☆147Updated last year
- PrettyTable is a simple Python library designed to make it quick and easy to represent tabular data in visually appealing ASCII tables.☆127Updated 3 years ago
- PyOO allows you to control a running OpenOffice or LibreOffice program for reading and writing spreadsheet documents☆104Updated 6 years ago
- xmlsjon converts XML into Python dictionary structures (trees, like in JSON) and vice-versa.☆124Updated 7 months ago
- CSS Selectors for Python☆305Updated last month
- Python library for extracting text from various file formats (for indexing).☆114Updated 3 years ago
- Regular Expression based parsers for extracting data from natural languages☆71Updated 8 years ago
- The simplest way to extract text from PDFs in Python☆428Updated 3 years ago
- SimpleSQLite is a Python library to simplify SQLite database operations: table creation, data insertion and get data as other data format…☆135Updated 2 months ago
- CFFI-based cairo bindings for Python.☆211Updated last month
- Fork of ReportLab http://www.reportlab.com/ftp/reportlab-2.5.tar.gz☆49Updated 2 years ago
- Python 3 port of pdfminer☆187Updated 7 years ago
- A Flask full-text search engine☆83Updated 6 years ago
- Mail merge for Office Open XML (docx) files without the need for Microsoft Office Word.☆278Updated last year
- Python text markup and conversion☆90Updated 5 years ago
- a Python implementation of the Unicode Collation Algorithm☆223Updated last year
- Lightweight data validation and adaptation Python library.☆262Updated 3 years ago
- PythonMagick is a Python binding for the ImageMagick Magick++ library, enabling image creation, editing, and conversion directly in Pytho…☆65Updated last year
- IO of git-style object databases☆226Updated last month
- Python powered spreadsheets☆172Updated 7 years ago
- Crochet: use Twisted anywhere!☆240Updated last year
- Offering FullText Search of MySQL in SQLAlchemy☆91Updated 4 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Python CSS-to-inline-styles conversion tool for HTML using BeautifulSoup and cssutils☆183Updated 5 years ago