CenterForOpenScience / pydocx
An extendable docx file format parser and converter
☆191Updated 4 years ago
Alternatives and similar repositories for pydocx:
Users that are interested in pydocx are comparing it to the libraries listed below
- Command-line tool for exploring and diagnosing problems with Microsoft Office Open XML files (.docx, .pptx, .xlsx)☆51Updated 6 months ago
- Python wrapper for Pandoc—the universal document converter.☆215Updated 9 years ago
- A threadsafe sqlite worker for Python☆99Updated 4 years ago
- A library for extracting tables from PDF files☆89Updated 4 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆149Updated 3 months ago
- Transport adapter for fetching file:// URLs with the requests python library☆86Updated 9 months ago
- Regular Expression based parsers for extracting data from natural languages☆70Updated 7 years ago
- PyQuery-based scraping micro-framework.☆116Updated 3 years ago
- Dump (freeze) SQL query results from a database into a selection of file formats☆92Updated 5 years ago
- Fork of ReportLab http://www.reportlab.com/ftp/reportlab-2.5.tar.gz☆38Updated last year
- Convert Word documents (.docx files) to HTML☆923Updated 3 months ago
- A simple python wrapper for PDFium.☆17Updated 3 years ago
- Python module to drive the awesome pdftk binary.☆148Updated 2 years ago
- Flask extenstion which provides Spyne support☆45Updated 4 years ago
- A module for querying the DOM tree and writing XPath expressions using native Python syntax.☆127Updated 6 years ago
- PyOO allows you to control a running OpenOffice or LibreOffice program for reading and writing spreadsheet documents☆103Updated 5 years ago
- Conservatively convert html to markdown☆98Updated 4 years ago
- CSS Selectors for Python☆293Updated 3 weeks ago
- Flask webassets integration.☆457Updated last year
- Python package for Google's diff-match-patch native C++ implementation.☆75Updated 10 months ago
- Pythonic Git for Humans☆732Updated 7 years ago
- Convert html to docx☆77Updated 9 months ago
- A Python module that tries to figure out what your local timezone is☆201Updated last month
- Generate PDF files out of your Flask website thanks to WeasyPrint☆147Updated 4 months ago
- Find the path of a key / value in a JSON hierarchy easily.☆95Updated 2 years ago
- A utility to read and write PDFs with Python☆335Updated 3 years ago
- Python powered spreadsheets☆173Updated 6 years ago
- Python binding to libpoppler-qt5☆42Updated last year
- Mirror of https://bitbucket.org/rptlab/reportlab☆64Updated 2 years ago
- A Python toolkit for processing tabular data☆417Updated last month