midorikocak / docsplitterLinks
A small tool to split .docx files by headings.
☆13Updated 5 years ago
Alternatives and similar repositories for docsplitter
Users that are interested in docsplitter are comparing it to the libraries listed below
Sorting:
- Interactive visualization of Wiktionary words and etymologies.☆98Updated 3 weeks ago
- In-browser OCR of Ancient Greek and Latin☆26Updated last month
- A simple word processor built with web technologies.☆71Updated 3 years ago
- LF Aligner helps translators create translation memories from texts and their translations. It relies on Hunalign for automatic sentence …☆14Updated 10 years ago
- CAT (Computer Aided Translation) tool based on Open Standards☆100Updated 2 weeks ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆134Updated last month
- Pre-Recognition Library - library with algorithms for improving OCR quality.☆36Updated 4 years ago
- Building scantailor and its dependencies☆65Updated 2 years ago
- ScanTailor Universal - a fork based on Enhanced+Featured+Master versions of ST☆239Updated last month
- Search engine for structured data☆24Updated 2 weeks ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆67Updated 2 years ago
- QtSemanticNotes is a personal knowledge base, personal wiki or just note taking application that features automatic linking, tree view an…☆19Updated 8 years ago
- TMX Editor written in Java and TypeScript☆49Updated 2 months ago
- OCR for DjVu☆47Updated 3 years ago
- smoothscan is a tool to convert scanned text into a vectorized output form.☆68Updated 12 years ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆156Updated 2 years ago
- The hOCR Embedded OCR Workflow and Output Format☆75Updated last year
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆91Updated 4 months ago
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆40Updated last month
- A post-processing tool for scanned sheets of paper.☆85Updated last year
- Super-project that aggregates all Pipeline related code, provides a common tracker for Pipeline related issues and holds the Pipeline web…☆24Updated this week
- PAGE XML format collection for document image page content and more☆70Updated 3 weeks ago
- Polytonic Greek OCR tool suite based on Ocropus 0.7☆13Updated 2 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆194Updated 2 months ago
- The CIS OCR PostCorrectionTool☆44Updated 3 years ago
- Linux-intelligent-ocr-solution☆149Updated 8 months ago
- Document Image Binarization☆79Updated last year
- Minstrel is a FLOSS hybrid reading app specifically designed for Audio-eBooks☆98Updated 9 years ago
- Scripts to auto-OCR PDFs, translate output using publicly-available or DIY NLP translation models, and generate epub/PDF☆44Updated last year
- User contributed (non Google) OCR models for Tesseract☆30Updated 9 months ago