REMitchell / tesseract-trainerLinks
Tools used to generate training files for the Tesseract OCR project
☆90Updated 2 years ago
Alternatives and similar repositories for tesseract-trainer
Users that are interested in tesseract-trainer are comparing it to the libraries listed below
Sorting:
- Scrapy Middleware to set a random User-Agent for every Request.☆202Updated 6 years ago
- Scrapy extension to control spiders using JSON-RPC☆300Updated 6 years ago
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawls☆277Updated 11 months ago
- Generator of User-Agent header☆348Updated 4 months ago
- Python wrapper for the tesseract OCR engine. The module is based on OpenCV☆178Updated 8 years ago
- Solana Arbitrage Bot on pump.fun, Meteora, Raydium and Orca using Jito bundling, RPC and gRPC. Solana Arbitrage Bot Solana Arbitrage Bot …☆495Updated 2 months ago
- Command line client for Scrapyd server☆778Updated last month
- Random User-Agent middleware based on fake-useragent☆690Updated 2 years ago
- Python 3 port of pdfminer☆187Updated 7 years ago
- This is the sample code for the Introduction to Tornado book, published by O'Reilly Media.☆569Updated last year
- Useful test spiders for Scrapy☆184Updated 6 years ago
- An elementary captcha decoder written in python☆155Updated 10 years ago
- A simple wrapper for zbar☆166Updated last week
- Python library of web-related functions☆414Updated last week
- Fill HTML login forms automatically☆276Updated last year
- This is a sample Scrapy project for educational purposes☆1,352Updated 2 years ago
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆33Updated 7 years ago
- A small framework taking over the manual training process described in the Tesseract3 Wiki: https://code.google.com/p/tesseract-ocr/wiki/…☆131Updated 2 years ago
- A python wrapper for Browsermob Proxy☆244Updated last year
- Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy☆368Updated 10 months ago
- A client interface for Scrapinghub's API☆204Updated 4 months ago
- Extends Selenium WebDriver classes to include the request function from the Requests library, while doing all the needed cookie and reque…☆496Updated last year
- Scrapy project based on dirbot to show how to use Twisted's adbapi to store the scraped data in MySQL.☆118Updated 12 years ago
- Simple library to send email using GMail (includes background worker and logging classes) [Python 2/3 support]☆110Updated 8 years ago
- Utilities for working with Excel files that require both xlrd and xlwt.☆272Updated 6 years ago
- Use pyppeteer from a Scrapy spider☆59Updated 6 years ago
- Scrapy Training companion code☆173Updated 7 years ago
- Whoosh indexing capabilities for Flask-SQLAlchemy☆283Updated 2 years ago
- MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the item…☆358Updated 4 years ago
- Kafka-based components for Scrapy☆78Updated 7 years ago