rajbot / autocropLinks
This is a side project from 2008. This package contains a tool for automatically cropping and deskewing images of book pages captured by an Internet Archive Scribe bookscanner.
☆28Updated 12 years ago
Alternatives and similar repositories for autocrop
Users that are interested in autocrop are comparing it to the libraries listed below
Sorting:
- Vidscraper is a python library which provides a simple API for fetching video data from various web services and sites.☆62Updated 3 years ago
- Serapis is a sentence identifier and modeling pipeline / built for Wordnik☆24Updated 9 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆24Updated 8 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 3 years ago
- your elastic friend to start supervisord processes based on cpu cores available.☆16Updated 9 years ago
- Neddick: Open Source Information Discovery Platform☆36Updated 2 years ago
- A python framework to generate html and JavaScript from reusable and combine-able widgets.☆23Updated 2 years ago
- DEPRECATED - Code for source.mozillaopennews.org/☆36Updated 6 years ago
- Simple to use python library for Buffer App☆23Updated 2 years ago
- Django feeds provides an extensive database model for RSS feeds and a fault tolerant parser.☆31Updated 13 years ago
- A queue-controlled browser automation tool for improving web crawl quality☆61Updated 4 months ago
- A sample app that combines geolocated entities from Freebase with Maps API☆42Updated 11 years ago
- A MediaWiki-to-HTML parser for Python.☆53Updated 5 years ago
- A multitouch python framework☆97Updated 3 years ago
- Python library implementing the ISO/IEC 26300 OpenDocument Format standard (ODF)☆54Updated 5 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Updated 11 years ago
- This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet…☆29Updated 7 months ago
- A Python library for generating fake user data.☆142Updated 8 years ago
- The more often you click a word in the headlines, the more interesting are your news.☆13Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Updated 11 years ago
- HTML5 Customizable Reader & Admin Console - Librelio Digital Publishing Suite☆29Updated 9 years ago
- Code for several utilities for use with VIVO☆11Updated 12 years ago
- Pyline is a grep-like, sed-like, awk-like command-line tool for line-based text processing in Python. https://pypi.python.org/pypi/pyline☆38Updated last month
- Some convenient natural language tools that build on NLTK.☆85Updated 11 years ago
- Wikipedia citation tool for Google Books, New York Times, ISBN, DOI and more☆22Updated 8 years ago
- stacked authentication policies for pyramid☆41Updated 10 months ago
- Django framework for crowdsourcing complex tasks using MTurk☆64Updated 14 years ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆12Updated 10 years ago
- A DSL to build Lucene text queries in Python.☆38Updated 8 years ago