hollingsworthd / ScreenSlicer
Automatic, zero-config web scraping -- written in Java, has no dependency on Java EE or app servers, and the web scraper has a restful/JSON API. Currently unmaintained.
☆155Updated 7 years ago
Alternatives and similar repositories for ScreenSlicer
Users that are interested in ScreenSlicer are comparing it to the libraries listed below
Sorting:
- Examples for my book "Power Java"☆21Updated 2 years ago
- A stream of deduplicated tweets built using RxJava and Twitter4J☆10Updated 9 years ago
- Slinky, a high-performance web crawler / text analytics in Python, Redis, Hadoop, R, Gephi☆41Updated 14 years ago
- ☆20Updated 8 years ago
- Create python web applications for Google Glass☆280Updated 11 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 3 years ago
- ☆49Updated 8 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- Python script to help you decide what movie to watch.☆34Updated 9 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- A script to get summary of text content☆31Updated 8 years ago
- A crawler to collect reviews and product information on Amazon.com☆75Updated 8 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- A component based data flow framework with a drag-n-drop Web 2.0 interface. Based on Stackless Python and inspired by Yahoo! Pipes.☆150Updated 12 years ago
- Tagger for questions posted on StackExchange Network☆37Updated 7 years ago
- cron-like jobs for back-end systems☆76Updated 6 years ago
- An alternative take on Java object relational mapping☆51Updated 8 months ago
- A simple proxy web service in 19 lines of Python code.☆23Updated 10 years ago
- Open Source Social Media Monitoring And Engagement System Core/API☆36Updated 10 years ago
- Ready-to-run examples to accompany the "Programming Google App Engine" books, by Dan Sanderson☆16Updated 10 years ago
- A Java library that can do URL normalization, unshorten URL, and URL extraction.☆19Updated 7 years ago
- Blog crawler for the blogforever project.☆22Updated 11 years ago
- 'People who downloaded this paper also downloaded...'☆51Updated 12 years ago
- A web application for tracking your portfolio☆24Updated 2 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- [DEPRECATED] A Java client for the FullContact API☆28Updated 4 years ago
- XTractor is an algorithmic text extractor from web pages written in Java. It builds upon the "commonly used web design practices" approac…☆43Updated 9 years ago
- DEPRECATED: Go to https://github.com/prodicus/spammy for DEV version☆14Updated 9 years ago
- convenient web rss-reader☆51Updated last year