ybur-yug / python_ocr_tutorial
This is a tutorial on getting OCR running on a simple web server, using python, flask, tesseract-ocr, and leptonica
☆256Updated 3 years ago
Related projects: ⓘ
- ☆46Updated this week
- A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab☆930Updated 6 years ago
- Python script to do PDF OCR conversion using Tesseract☆372Updated last year
- "Scrape Easy" - an extension of the Scrapy framework.☆188Updated 8 years ago
- An implementation of RESTful web service for tesseract-OCR using tornado☆134Updated last year
- A simple python OCR engine using opencv☆522Updated 7 months ago
- ☆125Updated 9 years ago
- Mapping photos of Old New York☆288Updated this week
- A more complete example of programming with PDFMiner, which continues where the default documentation stops☆215Updated 4 years ago
- A Google Charts API for Python, meant to be used as an alternative to matplotlib.☆205Updated 7 years ago
- The simplest way to extract text from PDFs in Python☆426Updated 2 years ago
- Extract tables from PDF pages.☆274Updated 4 years ago
- Python wrapper for the tesseract OCR engine. The module is based on OpenCV☆178Updated 7 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆130Updated last year
- A Python toolkit for processing tabular data☆414Updated last month
- Easier wrangling of web data.☆259Updated 6 years ago
- An OpenCV based document scanner☆794Updated 8 years ago
- Tool to extract news articles from newspaper and give the context about the news☆211Updated 7 years ago
- A python script for summarizing articles using nltk☆540Updated 8 years ago
- Detect text blocks and OCR poorly scanned PDFs in bulk. Python module available via pip.☆1,272Updated 3 years ago
- Process image to capture text and then use tesseract to computer OCR☆75Updated 9 years ago
- Python web scraping framework☆315Updated 6 years ago
- PyWebhooks - A Webhooks Service☆87Updated 5 years ago
- Various documents related to Tesseract OCR☆260Updated 3 years ago
- A library for extracting tables from PDF files☆90Updated 10 years ago
- Minimalist Requests wrapper to work within rate limits of any amount of services simultaneously. Parallel processing friendly.☆417Updated 6 years ago
- A simple program to extract the text from an image before performing OCR☆221Updated 4 years ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆363Updated last month
- ☆33Updated this week
- ☆167Updated 5 years ago