tesseract4java / tesseract4javaLinks
Java GUI and Tools for Tesseract OCR
☆328Updated last year
Alternatives and similar repositories for tesseract4java
Users that are interested in tesseract4java are comparing it to the libraries listed below
Sorting:
- Java OCR allows you to perform OCR and bar code recognition on images (JPEG, PNG, TIFF, PDF, etc.) and output as plain text, xml with ful…☆135Updated 9 years ago
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆184Updated 2 years ago
- Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV☆72Updated 2 years ago
- JAI ImageIO Core (without javax.media.jai dependencies)☆243Updated last year
- documents4j is a Java library for converting documents into another document format☆577Updated 4 months ago
- pdfOCR is an iText 7 add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-compli…☆36Updated 3 weeks ago
- Java JNA Wrapper for Leptonica Image Processing Library☆30Updated this week
- Java JNA wrapper for Tesseract OCR API☆1,672Updated 3 months ago
- (Java)A Method to Extract Tabular Content from PDF Files☆334Updated 2 years ago
- Web Browser, Flash Player, HTML editor, Media player for Swing☆198Updated 2 years ago
- Java library for reading, writing, converting and manipulating images and metadata☆207Updated 9 months ago
- Open-source barcode encoding program written in Java☆360Updated 3 weeks ago
- Plain Java unrar library☆297Updated last week
- Java2word is a Library to generate MS Word Documents from Java code without any special components.☆95Updated 3 years ago
- pdfHTML is an iText add-on for Java that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, sea…☆241Updated this week
- Java GUI frontend for Tesseract OCR engine☆64Updated 3 months ago
- PDF Rendering and Viewing API in Java☆96Updated last week
- 📘 A Citation Style Language (CSL) processor for Java.☆93Updated last week
- Aspose.PDF for Java examples, plugins and showcases☆133Updated 3 months ago
- Converts XHTML to OpenXML WordML (docx) using docx4j☆143Updated last month
- java decaptcha☆142Updated 4 years ago
- Apache XML Graphics Batik☆229Updated this week
- Lobo is an extensible all-Java web browser and RIA platform. It supports HTML 5, Javascript (AJAX) and CSS 3 plus direct JavaFX and Java …☆98Updated last year
- OCR evaluation brought to you by University of Alicante☆67Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆392Updated 9 months ago
- Originally exported from code.google.com/p/juniversalchardet☆357Updated 3 weeks ago
- Aspose.OCR for Java Examples and Sample Projects☆43Updated last year
- Test area for public PDFBox v2 issues on stackoverflow etc☆85Updated 2 months ago
- Java library for rendering PDF documents to the screen using Java2D☆191Updated 2 years ago
- Various documents related to Tesseract OCR☆265Updated 3 years ago