etfre / oodocxLinks
Python library for creating, reading, and modifying Docx files for Microsoft Word
☆16Updated 11 years ago
Alternatives and similar repositories for oodocx
Users that are interested in oodocx are comparing it to the libraries listed below
Sorting:
- Convert a corpus of PDF to clean text files on a distributed architecture☆38Updated last year
- Batch convert PDF files to text under Windows, using several text extraction methods or OCR☆35Updated 9 years ago
- Extract tables from PDF pages.☆296Updated 5 years ago
- A simple viewer and inspection tool for text boxes in PDF documents☆95Updated 3 years ago
- copy of pdftohtml code with enhancements☆25Updated last year
- Super-project that aggregates all Pipeline related code, provides a common tracker for Pipeline related issues and holds the Pipeline web…☆22Updated 2 weeks ago
- Live Transcription for Augmented Reality Glasses☆12Updated 2 weeks ago
- Archive.org OPDS Bookserver - A standard for digital book distribution☆130Updated 6 years ago
- Terminology management web platform☆49Updated 3 years ago
- small collection of python scripts for pdf manipulation☆95Updated last year
- KISS genealogy tree visualization using d3.js + birthday calendar☆27Updated 2 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆17Updated 9 months ago
- A PDFMiner wrapper to ease the text extraction from pdf files.☆25Updated 12 years ago
- Draft Markdeep diagrams with a live-updating preview overlaid onto your source code.☆32Updated 6 months ago
- Python module for interacting with OCLC's WorldCat APIs, including the WorldCat Search API, the WorldCat Registry API and the xID APIs (x…☆61Updated 13 years ago
- Structured Data from PDF image-based files☆89Updated 12 years ago
- Simple Python GUI Tool for Tesseract4☆15Updated 5 years ago
- An expandable and scalable OCR pipeline☆87Updated 7 years ago
- Small library containing various image processing algorithms (+ Python 3 bindings) that has almost no dependencies -- Moved to Gnome's Gi…☆62Updated 7 years ago
- uEngine5 BPMS that totally re-written in Microservices architecture. uEngine5 can act as not only a conventional Workflow or BPMS but als…☆10Updated 3 years ago
- A command-line tool for interacting with books in git☆111Updated last year
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆36Updated last year
- Transforms literary/philosophical texts into patent applications☆360Updated 11 years ago
- Scraper for downloading the entire ebooks repository of project Gutenberg☆152Updated last month
- Fast PDF generation and compression. Deals with millions of pages daily.☆122Updated last week
- Predicts likes, comment or total interactions of a facebook page post using machine learning☆10Updated 7 years ago
- compare two PDF files, write a resulting PDF with highlighted changes☆57Updated last year
- A declarative, parameter-parsing library that provides multiple parsing interfaces (YAML, command line, and JSON)☆13Updated 4 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago