tesseract4java / tesseract4java
Java GUI and Tools for Tesseract OCR
☆328Updated last year
Alternatives and similar repositories for tesseract4java:
Users that are interested in tesseract4java are comparing it to the libraries listed below
- Java JNA wrapper for Tesseract OCR API☆1,665Updated 2 months ago
- Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV☆72Updated last year
- Java OCR allows you to perform OCR and bar code recognition on images (JPEG, PNG, TIFF, PDF, etc.) and output as plain text, xml with ful…☆134Updated 9 years ago
- Box editor and trainer for Tesseract OCR☆239Updated 9 months ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆389Updated 8 months ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆182Updated 2 years ago
- Aspose.OCR for Java Examples and Sample Projects☆43Updated last year
- documents4j is a Java library for converting documents into another document format☆574Updated 2 months ago
- Plain Java unrar library☆295Updated 2 months ago
- pdfHTML is an iText add-on for Java that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, sea…☆240Updated last week
- Best (most accurate) trained LSTM models.☆1,332Updated last year
- Convert Word documents to simple and clean HTML☆263Updated 3 months ago
- JAI ImageIO Core (without javax.media.jai dependencies)☆240Updated last year
- (Java)A Method to Extract Tabular Content from PDF Files☆332Updated 2 years ago
- Java GUI frontend for Tesseract OCR engine☆64Updated 2 months ago
- JPEG2000 support for Java Advanced Imaging Image I/O Tools API☆77Updated last year
- Fast integer versions of trained LSTM models☆532Updated 8 months ago
- Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)☆188Updated 2 months ago
- An Optical Character Recognition Framework in Java☆31Updated 11 years ago
- Source training data for Tesseract for lots of languages☆854Updated 3 weeks ago
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- Aspose.PDF for Java examples, plugins and showcases☆131Updated 2 months ago
- Web Browser, Flash Player, HTML editor, Media player for Swing☆196Updated 2 years ago
- Java library for rendering PDF documents to the screen using Java2D☆191Updated last year
- Java JNA Wrapper for Leptonica Image Processing Library☆30Updated 2 months ago
- Powerful, hierachical based desktop search engine based on swing and lucene.☆18Updated 8 years ago
- Example apps for springboot-javafx-support. See☆161Updated 5 years ago
- A Java library to convert .pdf files into .epub, .txt, .png, .jpg, .zip formats.☆211Updated 6 months ago
- Patched JPedal based on the last official JPedal version 4.92☆20Updated 3 years ago
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆86Updated last month