Tjatse / spider2
A 2nd generation spider to crawl any article site, automatic read title and article.
☆43Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for spider2
- Friendly web crawler for x-ray☆44Updated last year
- Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they a…☆41Updated 7 years ago
- sandcrawler.js - the server-side scraping companion.☆107Updated 8 years ago
- Simhash implementation in Javascript☆38Updated 7 years ago
- A web crawler/scraper/spider for nodejs☆67Updated 7 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆343Updated 6 years ago
- A node.js module to implement a recommender engine with popular machine-learning algorithms.☆61Updated 9 years ago
- Linear regression with Gradient descent package for NPM.☆45Updated 11 years ago
- Linear Regression library in pure Javascript☆42Updated 11 years ago
- Cookieless user tracking for node.js☆52Updated 9 years ago
- x-ray's selector parser.☆16Updated 8 years ago
- High-availability redis in Node.js.☆154Updated 6 years ago
- Extract meta-data from a html string. It extracts the body, title, meta-tags and first headlines to a object to push them to a search ind…☆14Updated 8 years ago
- NLP utilities in javascript and coffeescript☆37Updated 10 years ago
- Take screenshots☆40Updated last year
- A simple desktop notification for electron apps☆55Updated 4 years ago
- Image optimizer, PNG, JPEG and GIFimage compress on OS X, Linux, FreeBSD and Windows☆67Updated 4 years ago
- Node.js module to extract and summarize html content☆42Updated 10 years ago
- [DEPRECATED] Matilda.js v0.0.2 -- Webscale Inference Toolkit☆24Updated 6 years ago
- Node.js Client Library for seaweedfs (weed-fs)☆50Updated 2 years ago
- a lightweight proxy that lets you to drive phantomjs from node.☆137Updated 10 years ago
- A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.☆52Updated 7 years ago
- Nodejs wrapper for Stanford Classifier.☆47Updated 3 years ago
- Redis Message Connector☆14Updated 5 years ago
- Tokenize paragraphs into sentences, and smaller tokens.☆48Updated last year
- general server render base on headless chrome☆95Updated 6 years ago
- Redis-based task queue library inspired by Celery and Kue.☆56Updated 10 years ago
- THIS REPO HAS BEEN MOVED TO https://github.com/sockethub/sockethub - a simple tool to facilitate handling and referencing activity stream…☆11Updated 4 years ago
- Criteria queries on JSON objects Mongo style☆33Updated 6 years ago