rajbot / autocrop
This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by an Internet Archive Scribe bookscanner.
☆28Updated 11 years ago
Alternatives and similar repositories for autocrop:
Users that are interested in autocrop are comparing it to the libraries listed below
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Import GeoNames.org data into a SQLite database for full-text search and autocomplete☆35Updated 5 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Updated 10 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- PIL-compatible interface for platform libraries such as GraphicsMagick, Aware or JAI.☆25Updated 7 years ago
- Document Imaging Archive System. Home document imaging, with OCR. Scan documents (with SANE) or import ODF documents, assign tags. Use op…☆24Updated 9 years ago
- All the reports and data powering http://weekly.hatnote.com☆12Updated this week
- A simple PDF transcription project for PyBossa☆19Updated 9 years ago
- Smart progressbar with multiple backends supporting both explicit updating and tqdm-style iterable-wrapping☆10Updated 8 years ago
- Experiments mining image collections using OpenCV☆64Updated 9 years ago
- Python bindings to the Tesseract API☆66Updated 8 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Updated 4 years ago
- Energy system for social games☆30Updated 10 years ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- An experimental Python REPL for editing Photos☆12Updated 4 years ago
- Path utilities for Python☆48Updated last year
- A python framework to generate html and JavaScript from reusable and combine-able widgets.☆23Updated 2 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Attach files to Kinto records☆42Updated last week
- your elastic friend to start supervisord processes based on cpu cores available.☆16Updated 9 years ago
- A django port of MySociety's FixMyStreet, maintained by VisibleGovernment.ca☆54Updated 13 years ago
- A clean-room clone of the Fever RSS aggregator, focusing on the API☆61Updated 2 years ago
- Download subscriptions from YouTube☆11Updated 6 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆60Updated 4 years ago
- Webhooks for Django *experimental*☆63Updated 15 years ago
- A skip dict is a Python dictionary which is permanently sorted by value.☆19Updated 10 years ago
- For code related to making ePub files☆40Updated 9 years ago
- in browser audio editor in the vein of audacity☆30Updated 9 years ago
- Feedbuffer buffers RSS and Atom syndication feeds, that is to say it caches new feed entries until the news aggregator requests them and …☆19Updated 8 years ago
- Experimental html based terminal emulator using pyte and webkit.☆29Updated 7 years ago