tesseract-ocr / tesseract-ocr.github.ioLinks
Tesseract documentation
☆75Updated 4 years ago
Alternatives and similar repositories for tesseract-ocr.github.io
Users that are interested in tesseract-ocr.github.io are comparing it to the libraries listed below
Sorting:
- Source training data for Tesseract for lots of languages☆863Updated 10 months ago
- Website crawler for Adblock Plus☆26Updated 9 years ago
- Fast integer versions of trained LSTM models☆594Updated last year
- Java GUI and Tools for Tesseract OCR☆335Updated 2 years ago
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆91Updated 4 months ago
- Want to learn more about Free Law Project technologies, policies and thinking? Get the literature here.☆25Updated 4 years ago
- Repository for tesseract testing☆35Updated last year
- Part of eMOP: Franken+ tool for creating font training for Tesseract OCR engine from page images.☆24Updated 10 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- ABBYY Cloud OCR SDK☆528Updated 2 years ago
- User contributed (non Google) OCR models for Tesseract☆30Updated 9 months ago
- NARA File Analyzer and Metadata Harvester☆112Updated 9 years ago
- Alternative UI for Apereo uPortal (originally built for MyUW)☆25Updated 3 years ago
- Best (most accurate) trained LSTM models.☆1,501Updated last year
- OCR evaluation brought to you by University of Alicante☆67Updated 3 years ago
- Tool for visualizing hOCR output from Tesseract (or other OCR engines that support hOCR).☆26Updated 11 years ago
- ☆39Updated 10 years ago
- Structured Data from PDF image-based files☆90Updated 12 years ago
- scraper related helper functions☆27Updated 11 years ago
- Mapping photos of Old New York☆293Updated last year
- Common tools and content for MongoDB documentation projects.☆44Updated 2 years ago
- Box editor and trainer for Tesseract OCR☆251Updated last month
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- a quick and dirty script to convert a Word (docx) document to html.☆54Updated last month
- Logya is a static site generator written in Python designed to be easy to use and flexible.☆18Updated 2 months ago
- Terminology management web platform☆50Updated 3 years ago
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆216Updated 6 years ago
- Eclipse Node.js IDE - p2f and knowledge base☆52Updated 11 years ago
- ALTO XML schema - latest and all former versions☆55Updated 3 weeks ago
- Wrapper for pdftohtml that tries to extract paragraph structure☆52Updated 7 years ago