CenterForOpenScience / pydocx
An extendable docx file format parser and converter
☆191Updated 4 years ago
Alternatives and similar repositories for pydocx:
Users that are interested in pydocx are comparing it to the libraries listed below
- Command-line tool for exploring and diagnosing problems with Microsoft Office Open XML files (.docx, .pptx, .xlsx)☆48Updated 5 months ago
- Python powered spreadsheets☆173Updated 6 years ago
- Python module to drive the awesome pdftk binary.☆148Updated last year
- Create, read, and modify Excel .xlsx files☆107Updated 4 years ago
- Python wrapper for Pandoc—the universal document converter.☆215Updated 9 years ago
- Convert Word documents (.docx files) to HTML☆915Updated 2 months ago
- XPath 1.0/2.0/3.0/3.1 parsers and selectors for ElementTree and lxml☆77Updated 2 weeks ago
- Conservatively convert html to markdown☆98Updated 4 years ago
- Python package for Google's diff-match-patch native C++ implementation.☆74Updated 9 months ago
- Flask extenstion which provides Spyne support☆45Updated 4 years ago
- A utility to read and write pdfs with Python. Superseded: see https://github.com/knowah/PyPDF2☆89Updated 8 years ago
- Python wrapper of the HTML Tidy library for fixing invalid HTML☆48Updated 4 years ago
- Python binding to libpoppler-qt5☆42Updated last year
- The simplest way to extract text from PDFs in Python☆427Updated 2 years ago
- Authorization tools for Flask☆107Updated 3 years ago
- CSS Selectors for Python☆293Updated this week
- Python bindings to the Tesseract API☆66Updated 8 years ago
- Regular Expression based parsers for extracting data from natural languages☆70Updated 7 years ago
- MongoDB Python logging handler, Centralized logging made simple using MongoDB.☆135Updated 5 years ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆214Updated 5 years ago
- Gzip flask responses☆95Updated 5 years ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆52Updated 2 months ago
- SQLAlchemy->Datatables☆54Updated 9 months ago
- Python library of web-related functions☆400Updated last month
- PyOO allows you to control a running OpenOffice or LibreOffice program for reading and writing spreadsheet documents☆102Updated 5 years ago
- Additional fields, validators and widgets for WTForms.☆68Updated 2 weeks ago
- Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python☆275Updated last week
- 📚 Ordered Multivalue Dictionary. Powers furl.☆68Updated 3 years ago
- Python flexible slugify function☆489Updated 4 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago