hollingsworthd / ScreenSlicerLinks
Automatic, zero-config web scraping -- written in Java, has no dependency on Java EE or app servers, and the web scraper has a restful/JSON API. Currently unmaintained.
☆155Updated 7 years ago
Alternatives and similar repositories for ScreenSlicer
Users that are interested in ScreenSlicer are comparing it to the libraries listed below
Sorting:
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- A simple proxy web service in 19 lines of Python code.☆23Updated 10 years ago
- ☆20Updated 8 years ago
- Blog crawler for the blogforever project.☆22Updated 11 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- ☆49Updated 8 years ago
- 'People who downloaded this paper also downloaded...'☆51Updated 12 years ago
- Mirror of Apache MRUnit☆38Updated 6 years ago
- Create python web applications for Google Glass☆280Updated 11 years ago
- ☆45Updated 8 years ago
- Bixo is an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop. By building a customized Cascading p…☆142Updated 2 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Real-Time, Twitter sentiment analyzer engine☆144Updated 11 years ago
- Algorithmic summarizer for RSS/Atom Feeds, Web Urls and arbitrary text. Codebase for the application deployed at http://tldrzr.herokuapp.…☆53Updated 8 years ago
- convenient web rss-reader☆51Updated last year
- ☆13Updated 9 years ago
- [DEPRECATED] A Java client for the FullContact API☆28Updated 4 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- Quickly analyze and explore email with advanced analytics and visualization.☆56Updated 3 years ago
- Vizlinc☆15Updated 9 years ago
- ☆21Updated 10 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- Faceted search engine for domain-specific exploration of the Web☆45Updated 8 years ago
- YCB Java☆27Updated last year
- Mavenized version of Kelvin Tan's example (http://www.lucenetutorial.com/lucene-in-5-minutes.html)☆70Updated 5 months ago
- Full text extraction using the Open Source Tesseract OCR software https://code.google.com/p/tesseract-ocr/ and imagemagick☆12Updated 10 years ago
- ScraperWiki Python library for scraping and saving data☆159Updated 2 years ago
- API server for TextBlob: Sentiment analysis, POS tagging, noun phrase extraction.☆24Updated 10 years ago
- faceted search engine☆41Updated 10 years ago