nguyenq / jTessBoxEditorLinks
Box editor and trainer for Tesseract OCR
☆247Updated 3 months ago
Alternatives and similar repositories for jTessBoxEditor
Users that are interested in jTessBoxEditor are comparing it to the libraries listed below
Sorting:
- Fast integer versions of trained LSTM models☆567Updated last year
- JavaFX Box editor and trainer for Tesseract OCR☆44Updated 3 months ago
- Source training data for Tesseract for lots of languages☆858Updated 6 months ago
- Best (most accurate) trained LSTM models.☆1,426Updated last year
- Train Tesseract LSTM with make☆698Updated 5 months ago
- Java GUI frontend for Tesseract OCR engine☆69Updated 3 months ago
- Line based ATR Engine based on OCRopy☆1,165Updated 4 months ago
- Various documents related to Tesseract OCR☆265Updated 4 years ago
- Data used for LSTM model training☆122Updated last year
- 一个简易的ofd解析库,支持OFD转图片☆66Updated 4 years ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆190Updated 2 years ago
- TWAIN Data Source Manager☆156Updated 2 years ago
- finetuned traineddata files for tesseract 4.0.0 for testing☆169Updated 6 years ago
- Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR e…☆625Updated 4 years ago
- A standalone Java library/command line tool that converts DOC, DOCX, PPT, PPTX and ODT documents to PDF files.☆609Updated 2 years ago
- Remove content from your digital documents irretrievably instead of just covering it up. Redact text, images, parts of images or drawings…☆42Updated 2 weeks ago
- Files and Scripts to run Tesseract 5 LSTM Training using fonts☆79Updated 3 years ago
- Java JNA Wrapper for Leptonica Image Processing Library☆30Updated 2 weeks ago
- Java OCR allows you to perform OCR and bar code recognition on images (JPEG, PNG, TIFF, PDF, etc.) and output as plain text, xml with ful…☆136Updated 10 years ago
- (Java)A Method to Extract Tabular Content from PDF Files☆335Updated 2 years ago
- this tool is used for edit word/excel/ppt in web.☆279Updated 10 years ago
- A command line tool to convert Microsoft Office documents to PDFs☆648Updated last year
- Tesseract 4 OCR Compilation - Docker Container☆55Updated 3 years ago
- ☆146Updated 5 years ago
- An implementation of CRNN (CNN+LSTM+warpCTC) on MxNet for chinese text recognition☆219Updated 2 years ago
- pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-complian…☆37Updated 2 weeks ago
- A Android client tool based on the OCR recognition engine that identifies the text of the table and exports the results in the form of an…☆62Updated last year
- pdfHTML is an iText add-on for Java that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, sea…☆249Updated last week
- docker-PaddleOCR☆25Updated last year
- Converts XHTML to OpenXML WordML (docx) using docx4j☆144Updated 2 weeks ago