midorikocak / docsplitter
A small tool to split .docx files by headings.
☆12Updated 4 years ago
Alternatives and similar repositories for docsplitter:
Users that are interested in docsplitter are comparing it to the libraries listed below
- LF Aligner helps translators create translation memories from texts and their translations. It relies on Hunalign for automatic sentence …☆12Updated 9 years ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆116Updated 8 months ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last week
- « Make your own typeface from your handwriting! ». ⚠ Work in progress. My fork adds multi-page scanned template, french/spanish accents, …☆25Updated 6 years ago
- Create CHM documentation from simple hierarchy of plain text files.☆25Updated 11 years ago
- Python-based research framework for developing, organizing, and deploying Deep Learning models powered by Tensorflow.☆12Updated 2 years ago
- A selection of test lines of several early printed books as well as the corresponding individual OCRopus models and mixed models.☆10Updated 7 years ago
- Theano Classical Fonts☆13Updated 3 years ago
- Interactive visualization of Wiktionary words and etymologies.☆92Updated 2 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆13Updated last year
- Building scantailor and its dependencies☆58Updated last year
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆65Updated last year
- Gamera 3 for Python 2 (deprecated)☆39Updated 2 years ago
- Super-project that aggregates all Pipeline related code, provides a common tracker for Pipeline related issues and holds the Pipeline web…☆21Updated last week
- Tools for professional translators running GNU/Linux☆31Updated 3 years ago
- Monomachus font for Old Cyrillic☆8Updated 8 years ago
- Master repository which includes most other OCR-D repositories as submodules☆73Updated 3 weeks ago
- The CIS OCR PostCorrectionTool☆42Updated 2 years ago
- A simple eBook stylesheet for dyslexia☆13Updated 8 years ago
- A web application to create and edit EPUBs, written in CakePHP.☆17Updated 10 years ago
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆86Updated last month
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆17Updated last year
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆49Updated 6 months ago
- Text to PDF converter with Unicode support☆74Updated last year
- XSLT stylesheets to convert TEI to HTML and ePub format.☆41Updated this week
- 'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.☆22Updated 7 years ago
- Automatic de-keystoning for single camera DIY book scanners☆22Updated 9 years ago
- Search engine for digital publication based on EPUB 3☆51Updated 7 years ago
- Search engine for structured data☆24Updated 2 months ago
- Automatic de-keystoning for single camera DIY book scanners.☆48Updated 4 years ago