Extract docx headers, footers, (formatted) text, footnotes, endnotes, properties, and images.
☆208Jun 8, 2026Updated this week
Alternatives and similar repositories for docx2python
Users that are interested in docx2python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A pure python based utility to extract text and images from docx files.☆586Mar 24, 2025Updated last year
- A simple library for segmenting legal texts☆18Apr 22, 2023Updated 3 years ago
- python module to manipulate text, strings and list of strings☆21May 10, 2022Updated 4 years ago
- Simplify DOCX files to JSON☆262Sep 26, 2024Updated last year
- The Python docx package cannot read paragraphs, tables and images in document order. It can only render all the paragraphs at once or all…☆84Mar 11, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Convert Word documents (.docx files) to HTML☆1,100May 24, 2026Updated 2 weeks ago
- Use built-in macOS optical character recognition (OCR) via the command line☆18Nov 17, 2025Updated 6 months ago
- Split a TiddlyWiki into multiple text files, one file per tiddler.☆11Apr 18, 2019Updated 7 years ago
- A small Finder clone for System 6. Just as a fun retro programming project.☆11Jul 19, 2022Updated 3 years ago
- Mirror of the Moby Project containing public-domain lexical resources; word lists, thesaurus, hyphenation, pronunciation.☆16Jul 27, 2014Updated 11 years ago
- ☆38Mar 10, 2016Updated 10 years ago
- The Data Exploration lesson in the Reproducible Science using Jupyter Notebooks curriculum☆14Aug 27, 2023Updated 2 years ago
- Create and modify Word documents with Python☆5,622Jun 17, 2025Updated 11 months ago
- Show the differences between two strings/text as a compact text, in markdown/HTML, in the terminal and more.☆156Mar 28, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Typed, annotated vectors for well-documented datasets☆11Apr 15, 2026Updated last month
- Convert JSON Schemas to simple, human-readable Markdown documentation. Repo archived in favor of fork: sbrunner/jsonschema2md2☆27Jul 12, 2023Updated 2 years ago
- Use a docx as a jinja2 template☆2,657May 18, 2026Updated 3 weeks ago
- Docx tracked change redlines for the Python ecosystem.☆110May 31, 2026Updated last week
- API client for fetching and comparing passages from legislation☆14Jan 26, 2025Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Jan 2, 2021Updated 5 years ago
- Torch implementation of the Collobert's SENNA system for NER.☆13Jun 27, 2016Updated 9 years ago
- An open source framework for Retrieval-Augmented System (RAG) uses semantic search helps to retrieve the expected results and generate h…☆22Nov 21, 2025Updated 6 months ago
- Open-source, knowledge-grounded conversational assistant☆14Jun 30, 2025Updated 11 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Large lexicon for APE (~100,000 entries)☆33Nov 5, 2018Updated 7 years ago
- Download client for legal opinions☆13Jun 2, 2026Updated last week
- A Betty Blocks Component Set based on Material UI☆25May 21, 2026Updated 3 weeks ago
- Python toolkit for SSSOM mapping format☆62Jun 1, 2026Updated last week
- My collection of Python scripts simplifying debugging under lldb☆11May 27, 2024Updated 2 years ago
- OniGuruma Regular Expression Framework for Cocoa☆35Jul 28, 2022Updated 3 years ago
- Convert any JSON File to Normalised tables☆16Oct 21, 2021Updated 4 years ago
- An extendable docx file format parser and converter☆194May 19, 2025Updated last year
- Themed, fully featured PDF viewer for the Atom editor☆12Jan 28, 2026Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆15Jun 16, 2021Updated 4 years ago
- ⚙ Skeleton TiddlyWiki for developing plugins using the ThirdFlow, see: https://thediveo.github.io/TiddlyWikiPluginSkeleton and https://yo…☆10Mar 1, 2020Updated 6 years ago
- Managing the progress for the RDA Working Group on Fair Mappings (https://www.rd-alliance.org/groups/fair-mappings-wg/).☆11May 19, 2026Updated 3 weeks ago
- ☆19Nov 1, 2023Updated 2 years ago
- A Python module to provide software abstractions to ease accessing hyperknowledge graphs☆11Dec 19, 2024Updated last year
- Client library for OpenOCR☆32Dec 3, 2014Updated 11 years ago
- mermaid extension to add support for mermaid graph inside markdown file☆37Dec 28, 2023Updated 2 years ago