Tjatse / spider2
A 2nd generation spider to crawl any article site, automatic read title and article.
☆43Updated 9 years ago
Alternatives and similar repositories for spider2:
Users that are interested in spider2 are comparing it to the libraries listed below
- Friendly web crawler for x-ray☆44Updated 2 years ago
- sandcrawler.js - the server-side scraping companion.☆107Updated 9 years ago
- Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they a…☆41Updated 8 years ago
- a lightweight proxy that lets you to drive phantomjs from node.☆136Updated 10 years ago
- Extract the content of any web page by using various content extractor libraries.☆10Updated 9 years ago
- A node.js module to implement a recommender engine with popular machine-learning algorithms.☆61Updated 9 years ago
- Node.js module to extract and summarize html content☆42Updated 10 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆343Updated 6 years ago
- NLP utilities in javascript and coffeescript☆38Updated 11 years ago
- NodeJS Named Entity Recognition, using Stanford NER (easy install)☆40Updated 7 years ago
- phantom driver for x-ray.☆111Updated 8 years ago
- Cookieless user tracking for node.js☆51Updated 9 years ago
- A web scraper with a web user interface which shows scraping stats in realtime. Uses Node.JS, jQuery, socket.io and Express.☆104Updated 10 years ago
- Demos for the limdu.js package☆18Updated 2 years ago
- Nodejs wrapper for Stanford Classifier.☆47Updated 4 years ago
- Linear regression with Gradient descent package for NPM.☆46Updated 11 years ago
- ExtractContent for node.js☆15Updated 6 years ago
- CelerFT a file uploader for Gigabit sized files over HTTP using a Node.js backend☆27Updated 7 years ago
- Upload large(r) videos to youtube via Google's 'resumable upload' API☆43Updated 6 years ago
- Redis time series statistics with Node.js☆181Updated 8 years ago
- Nodejs text sumarization☆55Updated 11 years ago
- A boilerplate for building a superscript bot☆37Updated 8 years ago
- x-ray's selector parser.☆16Updated 9 years ago
- Node wrapper around FastText Library☆57Updated 2 years ago
- Simhash implementation in Javascript☆38Updated 7 years ago
- A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.☆52Updated 7 years ago
- Hosted viewer for documentation.js JSON output.☆34Updated 7 years ago
- Node library to extract keywords from text☆58Updated 9 years ago
- A web crawler/scraper/spider for nodejs☆66Updated 7 years ago
- Rate limiter middleware, backed by Redis☆54Updated 2 years ago