Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.
☆208Jun 29, 2026Updated this week
Alternatives and similar repositories for docx2python
Users that are interested in docx2python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pure python based utility to extract text and images from docx files.☆586Mar 24, 2025Updated last year
- A simple library for segmenting legal texts☆18Apr 22, 2023Updated 3 years ago
- Simplify DOCX files to JSON☆263Sep 26, 2024Updated last year
- The Python docx package cannot read paragraphs, tables and images in document order. It can only render all the paragraphs at once or all…☆84Mar 11, 2024Updated 2 years ago
- Convert Word documents (.docx files) to HTML☆1,107May 24, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Split a TiddlyWiki into multiple text files, one file per tiddler.☆11Apr 18, 2019Updated 7 years ago
- A small Finder clone for System 6. Just as a fun retro programming project.☆11Jul 19, 2022Updated 3 years ago
- Building or integrating an LLM wrapper shouldn't take more than 10 minutes.☆13Feb 1, 2025Updated last year
- Mirror of the Moby Project containing public-domain lexical resources; word lists, thesaurus, hyphenation, pronunciation.☆16Jul 27, 2014Updated 11 years ago
- ☆38Mar 10, 2016Updated 10 years ago
- A collection of python routines to help identify and morph objects.☆13Oct 2, 2021Updated 4 years ago
- Write beautifully short contract. https://reference.legal/ is a referenceable clause library to standardize contracts once and for all.☆13Jul 12, 2022Updated 3 years ago
- ☆15Apr 25, 2015Updated 11 years ago
- Show the differences between two strings/text as a compact text, in markdown/HTML, in the terminal and more.☆158Mar 28, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Convert JSON Schemas to simple, human-readable Markdown documentation. Repo archived in favor of fork: sbrunner/jsonschema2md2☆27Jul 12, 2023Updated 2 years ago
- Use a docx as a jinja2 template☆2,668May 18, 2026Updated last month
- Docx tracked change redlines for the Python ecosystem.☆114May 31, 2026Updated last month
- API client for fetching and comparing passages from legislation☆14Jun 13, 2026Updated 2 weeks ago
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆14Jan 2, 2021Updated 5 years ago
- Open-source, knowledge-grounded conversational assistant☆14Jun 30, 2025Updated last year
- Download client for legal opinions☆13Jun 12, 2026Updated 2 weeks ago
- A Betty Blocks Component Set based on Material UI☆25May 21, 2026Updated last month
- Python toolkit for SSSOM mapping format☆62Jun 21, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- OniGuruma Regular Expression Framework for Cocoa☆35Jul 28, 2022Updated 3 years ago
- Convert any JSON File to Normalised tables☆16Oct 21, 2021Updated 4 years ago
- ☆17Apr 30, 2026Updated 2 months ago
- An extendable docx file format parser and converter☆194Jun 25, 2026Updated last week
- Themed, fully featured PDF viewer for the Atom editor☆12Jan 28, 2026Updated 5 months ago
- ☆15Jun 16, 2021Updated 5 years ago
- ☆19Nov 1, 2023Updated 2 years ago
- Client library for OpenOCR☆32Dec 3, 2014Updated 11 years ago
- Demo App☆11Jan 27, 2026Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Jun 17, 2026Updated 2 weeks ago
- Utilities and applications for the FlatGov project by Demand Progress☆17Feb 8, 2023Updated 3 years ago
- Symas (OpenLDAP) LMDB back-end for RDF::Repository☆17Updated this week
- A graph based dependency parser in PyTorch.☆28Oct 6, 2018Updated 7 years ago
- Leo code snippets☆13Apr 27, 2020Updated 6 years ago
- scraping and querying documents for LLMs☆24Oct 6, 2025Updated 8 months ago
- This repository contains materials for the Open Legal Data Forum at the Legal Hacker 2019 (September 2019 + Brooklyn, NYC)☆17Dec 8, 2022Updated 3 years ago