midorikocak / docsplitter
A small tool to split .docx files by headings.
☆12Updated 4 years ago
Alternatives and similar repositories for docsplitter:
Users that are interested in docsplitter are comparing it to the libraries listed below
- A simple python wrapper for PDFium.☆16Updated 3 years ago
- Docscan is a document scanner. Take a photo of your documents and frame it.☆98Updated 3 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆107Updated 6 months ago
- Pretrained mixed models to be used with Calamari.☆60Updated 4 months ago
- Efficient hOCR tooling☆42Updated this week
- Master repository which includes most other OCR-D repositories as submodules☆72Updated last week
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆86Updated last week
- User contributed (non Google) OCR models for Tesseract☆24Updated 4 months ago
- Poppler based fast pdf viewer written in PyQt5☆10Updated 5 months ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last month
- Light-weight image viewer with crop,resize,collage, photogrid and filters☆17Updated 7 months ago
- The hOCR Embedded OCR Workflow and Output Format☆74Updated 6 months ago
- Make PDF Files Accessible, Extract Data from PDF, Convert PDF to HTML, Fill-in PDF Form, Stamp PDF and more...☆20Updated 2 weeks ago
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Tesseract Powered Windows Desktop OCR Application With Multiple Pre/Post Processing GUI☆41Updated 10 months ago
- Pre-Recognition Library - library with algorithms for improving OCR quality.☆34Updated 3 years ago
- Building scantailor and its dependencies☆58Updated last year
- Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF☆42Updated 9 months ago
- Python book cover generator☆29Updated 4 years ago
- Tools for professional translators running GNU/Linux☆29Updated 3 years ago
- guides and test data for OCR4all☆30Updated 2 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆46Updated 3 months ago
- An EPUB editor☆21Updated 6 years ago
- Scan Tailor Experimental is an interactive post-processing tool for scanned pages.☆52Updated 2 weeks ago
- Gamera 3 for Python 2 (deprecated)☆39Updated 2 years ago
- ☆14Updated last month
- Prepress preparing tool and PDF editor☆17Updated last year
- PAGE XML format collection for document image page content and more☆67Updated 3 years ago
- OCR-D-compliant page segmentation☆67Updated last week
- a flexible unit converter☆40Updated last year