tesseract4java / tesseract4javaLinks
Java GUI and Tools for Tesseract OCR
☆335Updated 2 years ago
Alternatives and similar repositories for tesseract4java
Users that are interested in tesseract4java are comparing it to the libraries listed below
Sorting:
- Pdf2Dom is a PDF parser that converts the documents to a HTML DOM representation. The obtained DOM tree may be then serialized to a HTM…☆191Updated last month
- Java OCR allows you to perform OCR and bar code recognition on images (JPEG, PNG, TIFF, PDF, etc.) and output as plain text, xml with ful…☆137Updated 10 years ago
- Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV☆80Updated 2 years ago
- Box editor and trainer for Tesseract OCR☆250Updated last month
- pdfHTML is an iText add-on for Java that allows you to easily convert HTML and CSS into standards compliant PDFs that are accessible, sea…☆254Updated last week
- Convert Word documents to simple and clean HTML☆284Updated last month
- Java JNA Wrapper for Leptonica Image Processing Library☆30Updated 3 weeks ago
- Java library for rendering PDF documents to the screen using Java2D☆191Updated 2 years ago
- JAI ImageIO Core (without javax.media.jai dependencies)☆250Updated 2 years ago
- documents4j is a Java library for converting documents into another document format☆587Updated this week
- This will demonstrate extracting text from scanned documents ( pdf, jpg, tiff, bmp, png etc)☆30Updated 9 years ago
- A Java library to convert .pdf files into .epub, .txt, .png, .jpg, .zip formats.☆217Updated last year
- pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-complian…☆41Updated last week
- The Open Source RTF (Rich Text Format) Java Library☆47Updated last week
- An HTML to PDF conversion library written in Java, based on wkhtmltopdf.☆185Updated 7 years ago
- Aspose.PDF for Java examples, plugins and showcases☆140Updated last month
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆196Updated 2 weeks ago
- JODConverter automates document conversions using LibreOffice/OpenOffice.org☆466Updated 3 years ago
- (Java)A Method to Extract Tabular Content from PDF Files☆336Updated 2 years ago
- JPEG2000 support for Java Advanced Imaging Image I/O Tools API☆81Updated 2 years ago
- Converts XHTML to OpenXML WordML (docx) using docx4j☆147Updated 3 months ago
- Aspose.OCR for Java Examples and Sample Projects☆43Updated last year
- SWT Win32 Extension extends the Eclipse library SWT.☆36Updated 9 years ago
- Web Browser, Flash Player, HTML editor, Media player for Swing☆199Updated 2 years ago
- Java JNA wrapper for Tesseract OCR API☆1,731Updated this week
- XDocReport Samples☆56Updated 8 years ago
- Fast integer versions of trained LSTM models☆592Updated last year
- An Eclipse Plugin to integrate different Class Decompiler seamlessly into the development workflow☆275Updated last month
- The image4j library allows you to read and write certain image formats in 100% pure Java.☆82Updated 2 years ago
- VocabHunter helps learners of foreign languages find vital new vocabulary to study.☆290Updated last year