mawanda-jun / IntelligentOCRLinks
An intelligent OCR to detect tables and pure text inside PDFs and obtaing a csv file and a txt from it
☆14Updated 6 years ago
Alternatives and similar repositories for IntelligentOCR
Users that are interested in IntelligentOCR are comparing it to the libraries listed below
Sorting:
- Play the card game Baccarat☆14Updated 11 months ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆23Updated 4 years ago
- Neural Elastic Inference and Search☆19Updated 5 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆70Updated this week
- Document Image Classification☆11Updated 7 years ago
- A tool designed to extract numerical data from scanned historical weather documents.☆13Updated 6 months ago
- A simple viewer and inspection tool for text boxes in PDF documents☆95Updated 3 years ago
- Given a text, wrap it into phrases and send them to Yandex's search engine. If it yields a "did you mean:", substitute the original phras…☆11Updated 6 years ago
- Next generation OCR engine based on LSTMs.☆52Updated 7 years ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated last month
- An efficient data structure for fast string similarity searches☆22Updated 4 years ago
- Remove duplicate documents/videos/images via popular algorithms such as SimHash, SpotSig, Shingling, etc.☆18Updated last year
- Extract structured data from PDF invoices☆14Updated 4 years ago
- Easy formatted text extraction from images using Google Vision API☆42Updated 4 years ago
- A web application to process receipt images by Deep learning based OCR☆13Updated 4 years ago
- Automatic Table reader. Can extract table data from images.☆15Updated 6 years ago
- Convert a corpus of PDF to clean text files on a distributed architecture☆39Updated last year
- Detecting Trends in Job Advertisements☆20Updated 6 years ago
- Using pre-trained YOLO algorithm to detect faces in photo ID documents for ID verification☆10Updated 7 years ago
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆20Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 2 months ago
- Machine Learning-assisted correction of OCR errors in historical corpora☆9Updated 8 months ago
- A system for reading scanned documents and grouping them into high level topics☆16Updated 4 years ago
- 版面分析+OCR☆11Updated 3 years ago
- Translate many large PDF Reports for free using Python.☆33Updated 2 years ago
- Web App Capable of Predicting Next Word Using BERT☆14Updated 2 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Updated last year
- Image Pre-processing to improve OCR accuracy.☆20Updated 8 years ago
- Extract knowledge from raw text☆13Updated 3 years ago
- ☆11Updated last year