Tjatse / spider2
A 2nd generation spider to crawl any article site, automatic read title and article.
☆43Updated 9 years ago
Alternatives and similar repositories for spider2:
Users that are interested in spider2 are comparing it to the libraries listed below
- Friendly web crawler for x-ray☆44Updated 2 years ago
- Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they a…☆41Updated 8 years ago
- Extract the content of any web page by using various content extractor libraries.☆10Updated 9 years ago
- A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.☆52Updated 7 years ago
- phantom driver for x-ray.☆111Updated 8 years ago
- sandcrawler.js - the server-side scraping companion.☆107Updated 9 years ago
- Image optimizer, PNG, JPEG and GIFimage compress on OS X, Linux, FreeBSD and Windows☆67Updated 5 years ago
- Redis-based task queue library inspired by Celery and Kue.☆56Updated 10 years ago
- Simhash implementation in Javascript☆38Updated 7 years ago
- Node wrapper around PDF.JS library to read and render PDFs☆52Updated 8 years ago
- REST interface for Redis-Simple-Message-Queue☆43Updated 8 years ago
- node.js wrapper for the Diffbot API (article and frontpage)☆35Updated 9 years ago
- High-availability redis in Node.js.☆154Updated 6 years ago
- An express inspired, event-driven framework for building real time distributed applications over socket.io and redis.☆127Updated 9 years ago
- A node.js module to implement a recommender engine with popular machine-learning algorithms.☆61Updated 9 years ago
- Demos for the limdu.js package☆18Updated 2 years ago
- combine node / browser apps into a single script.☆37Updated 8 years ago
- A realtime clone of the Hacker News homepage backed by HN's Firebase API☆51Updated 10 years ago
- Cookieless user tracking for node.js☆51Updated 10 years ago
- x-ray's selector parser.☆16Updated 9 years ago
- a lightweight proxy that lets you to drive phantomjs from node.☆136Updated 10 years ago
- general server render base on headless chrome☆95Updated 7 years ago
- Redis Message Connector☆14Updated 6 years ago
- Watches for changes in MongoDB replication log.☆95Updated 10 years ago
- Qool, a leveldb backed Queue☆42Updated 8 years ago
- Adds `markdown` property to Nodemailer e-mail data☆37Updated 5 years ago
- Interactive terminal list for nodejs☆125Updated 11 years ago
- [DEPRECATED] Matilda.js v0.0.2 -- Webscale Inference Toolkit☆24Updated 7 years ago
- Article content extraction database☆40Updated 2 years ago
- A simple desktop notification for electron apps☆55Updated 4 years ago