nguyenq / jTessBoxEditorLinks
Box editor and trainer for Tesseract OCR
☆245Updated last month
Alternatives and similar repositories for jTessBoxEditor
Users that are interested in jTessBoxEditor are comparing it to the libraries listed below
Sorting:
- JavaFX Box editor and trainer for Tesseract OCR☆43Updated last month
- Fast integer versions of trained LSTM models☆557Updated last year
- Source training data for Tesseract for lots of languages☆858Updated 4 months ago
- Best (most accurate) trained LSTM models.☆1,392Updated last year
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆187Updated 2 years ago
- Codes And Documents For OcrKing Api☆228Updated last year
- Train Tesseract LSTM with make☆687Updated 3 months ago
- Line based ATR Engine based on OCRopy☆1,156Updated 2 months ago
- 一个简易的ofd解析库,支持OFD转图片☆66Updated 4 years ago
- Data used for LSTM model training☆119Updated last year
- Java JNA Wrapper for Leptonica Image Processing Library☆30Updated 2 weeks ago
- Leptonica is an open source library containing software that is broadly useful for image processing and image analysis applications. The …☆1,941Updated 2 weeks ago
- Java GUI frontend for Tesseract OCR engine☆67Updated last month
- Utility to convert PDF into JPG files☆57Updated 2 years ago
- Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR e…☆621Updated 4 years ago
- finetuned traineddata files for tesseract 4.0.0 for testing☆168Updated 6 years ago
- A scientific document recognition system☆169Updated 2 years ago
- A command line tool to convert Microsoft Office documents to PDFs☆643Updated last year
- 身份证识别OCR☆487Updated 2 years ago
- Tesseract 4 OCR Compilation - Docker Container☆54Updated 3 years ago
- docker-PaddleOCR☆23Updated last year
- Remove content from your digital documents irretrievably instead of just covering it up. Redact text, images, parts of images or drawings…☆40Updated last month
- Various documents related to Tesseract OCR☆266Updated 3 years ago
- Aspose.PDF for Java examples, plugins and showcases☆135Updated 5 months ago
- (Java)A Method to Extract Tabular Content from PDF Files☆335Updated 2 years ago
- A C# interface for TWAIN☆176Updated 2 years ago
- charlesw/tesseract 4.0 build for x64 Windows using C++ run-time 141.☆62Updated 6 years ago
- A standalone Java library/command line tool that converts DOC, DOCX, PPT, PPTX and ODT documents to PDF files.☆602Updated 2 years ago
- Converts XHTML to OpenXML WordML (docx) using docx4j☆146Updated 3 weeks ago
- pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compli…☆36Updated this week