rajbot / autocropLinks
This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by an Internet Archive Scribe bookscanner.
☆28Updated 12 years ago
Alternatives and similar repositories for autocrop
Users that are interested in autocrop are comparing it to the libraries listed below
Sorting:
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 2 years ago
- Django feeds provides an extensive database model for RSS feeds and a fault tolerant parser.☆31Updated 12 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Import GeoNames.org data into a SQLite database for full-text search and autocomplete☆35Updated 6 years ago
- PIL-compatible interface for platform libraries such as GraphicsMagick, Aware or JAI.☆25Updated 7 years ago
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆22Updated 8 years ago
- a Simple API for RDF☆29Updated 15 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 2 months ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Updated 11 years ago
- A simple PDF transcription project for PyBossa☆19Updated 9 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 9 years ago
- A MediaWiki-to-HTML parser for Python.☆53Updated 5 years ago
- A python abstraction for SKOS vocabularies☆18Updated 6 months ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- a web based tool to monitor how your website content is used in wikipedia☆37Updated 4 years ago
- ... just because nltk is too heavy☆35Updated 14 years ago
- Discover, analyze and present data from the web and mobile in meaninful ways☆82Updated 11 years ago
- Markdown -> IPython conversion tool☆15Updated 10 years ago
- A clean-room clone of the Fever RSS aggregator, focusing on the API☆61Updated 3 years ago
- Experiments mining image collections using OpenCV☆64Updated 10 years ago
- Pyline is a grep-like, sed-like, awk-like command-line tool for line-based text processing in Python. https://pypi.python.org/pypi/pyline☆38Updated 10 months ago
- Webhooks for Django *experimental*☆62Updated 15 years ago
- Cross-platform file locking in Python☆57Updated 10 years ago
- Python library for creating word clouds from text☆51Updated 6 years ago
- Python bindings to the Tesseract API☆66Updated 8 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- Google Refine extension for adding columns (extending data) from DBpedia☆39Updated 11 years ago
- All the reports and data powering http://weekly.hatnote.com☆13Updated this week
- Jabba's headless webkit browser for scraping AJAX-powered webpages.☆91Updated 10 years ago