rajbot / autocropLinks
This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by an Internet Archive Scribe bookscanner.
☆28Updated 12 years ago
Alternatives and similar repositories for autocrop
Users that are interested in autocrop are comparing it to the libraries listed below
Sorting:
- A python framework to generate html and JavaScript from reusable and combine-able widgets.☆23Updated 2 years ago
- a Simple API for RDF☆29Updated 15 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Updated 11 years ago
- Import GeoNames.org data into a SQLite database for full-text search and autocomplete☆35Updated 6 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- A simple PDF transcription project for PyBossa☆19Updated 9 years ago
- Python bindings to the WebKit GTK+ port☆67Updated 2 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- Simple to use python library for Buffer App☆23Updated 2 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 3 months ago
- A django port of MySociety's FixMyStreet, maintained by VisibleGovernment.ca☆54Updated 13 years ago
- A tool to upload and synchronize static websites to the Amazon S3 cloud.☆22Updated 9 years ago
- your elastic friend to start supervisord processes based on cpu cores available.☆16Updated 9 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Updated 4 years ago
- scraper related helper functions☆27Updated 11 years ago
- Experiments mining image collections using OpenCV☆64Updated 10 years ago
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆22Updated 8 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- A skip dict is a Python dictionary which is permanently sorted by value.☆19Updated 10 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆46Updated 7 years ago
- ... just because nltk is too heavy☆35Updated 14 years ago
- This application demonstrates how to use PostgreSQL as a full-text search engine.☆63Updated 7 years ago
- A bridge to the JS CoffeeScript compiler (EOL: Please use coffee command or webpack).☆82Updated 7 years ago
- Symbolic Constants in Python☆23Updated 9 months ago
- Gymnast: Pythonic PDF Parsing☆8Updated 8 years ago
- Vidscraper is a python library which provides a simple API for fetching video data from various web services and sites.☆62Updated 2 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 11 years ago
- Akara is an open-source (Apache2 license) Web framework specialized for RESTful data services, especially involving XML and other semi-st…☆25Updated 11 years ago
- Markdown -> IPython conversion tool☆15Updated 10 years ago