Calamari-OCR / calamari
Line based ATR Engine based on OCRopy
☆1,126Updated 2 weeks ago
Alternatives and similar repositories for calamari:
Users that are interested in calamari are comparing it to the libraries listed below
- OCR engine for all the languages☆796Updated this week
- Train Tesseract LSTM with make☆660Updated 9 months ago
- Python-based tools for document analysis and OCR☆3,444Updated 3 years ago
- Text detection with mainly MSER and SWT☆200Updated 4 months ago
- Visual Attention based OCR☆1,117Updated 6 years ago
- A small C++ implementation of LSTM networks, focused on OCR.☆825Updated 5 years ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆1,443Updated 7 months ago
- TableBank: A Benchmark Dataset for Table Detection and Recognition☆1,041Updated 7 months ago
- Various documents related to Tesseract OCR☆265Updated 3 years ago
- Fast integer versions of trained LSTM models☆522Updated 7 months ago
- A curated list of promising OCR resources☆1,676Updated 2 years ago
- Generic framework for historical document processing☆375Updated 3 years ago
- Repository collecting all the submodules for the new PyTorch-based OCR System.☆142Updated 4 years ago
- Total Text Dataset. It consists of 1555 images with more than 3 different text orientations: Horizontal, Multi-Oriented, and Curved, one …☆754Updated last year
- darknet text detect and darknet cnn ocr☆1,149Updated 3 years ago
- Generate text images for training deep learning ocr model☆1,424Updated 3 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆185Updated 3 months ago
- A simple document layout analysis using Python-OpenCV☆124Updated 4 years ago
- A Tensorflow model for text recognition (CNN + seq2seq with visual attention) available as a Python package and compatible with Google Cl…☆1,081Updated last year
- Document Image Dewarping☆362Updated 5 years ago
- ☆939Updated 6 months ago
- A pure pytorch implemented ocr project including text detection and recognition☆595Updated 3 years ago
- DocBank: A Benchmark Dataset for Document Layout Analysis☆602Updated 7 months ago
- Source training data for Tesseract for lots of languages☆850Updated last year
- Recognizing cropped text in natural images.☆730Updated 2 years ago
- Tensorflow, Luminoth Based Table Detection and Extraction☆163Updated 2 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆389Updated 7 months ago
- papers about ocr☆401Updated 2 years ago
- ☆430Updated 3 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆605Updated last year