Tjatse / spider2Links
A 2nd generation spider to crawl any article site, automatic read title and article.
☆43Updated 9 years ago
Alternatives and similar repositories for spider2
Users that are interested in spider2 are comparing it to the libraries listed below
Sorting:
- Friendly web crawler for x-ray☆44Updated 2 years ago
- Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they a…☆41Updated 8 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆344Updated 7 years ago
- tools for working with Princeton's lexical database WordNet☆73Updated 7 years ago
- phantom driver for x-ray.☆111Updated 9 years ago
- A web scraper with a web user interface which shows scraping stats in realtime. Uses Node.JS, jQuery, socket.io and Express.☆105Updated 10 years ago
- Demos for the limdu.js package☆18Updated 3 years ago
- Simhash implementation in Javascript☆38Updated 8 years ago
- Node.js module to extract and summarize html content☆42Updated 10 years ago
- sandcrawler.js - the server-side scraping companion.☆107Updated 9 years ago
- High-availability redis in Node.js.☆154Updated 7 years ago
- A simple node.js wrapper for stanford-core-nlp.☆149Updated 8 years ago
- A helper robot written in node javascript☆74Updated 13 years ago
- A node.js module to implement a recommender engine with popular machine-learning algorithms.☆61Updated 9 years ago
- A simple node.js wrapper for Stanford CoreNLP.☆77Updated 3 years ago
- a lightweight proxy that lets you to drive phantomjs from node.☆136Updated 10 years ago
- Nodejs wrapper for Stanford Classifier.☆47Updated 4 years ago
- NLP utilities in javascript and coffeescript☆37Updated 11 years ago
- Language sentiment analysis and neural networks... for trolls.☆333Updated 12 years ago
- THIS REPO HAS BEEN MOVED TO https://github.com/sockethub/sockethub - a simple tool to facilitate handling and referencing activity stream…☆12Updated 5 years ago
- Extract a list of keywords from a website, sorted by word count.☆51Updated 8 years ago
- Apache Tika bridge for Node.js. Text and metadata extraction, language detection and more.☆142Updated last year
- Nodejs text sumarization☆54Updated 11 years ago
- Node library to extract keywords from text☆58Updated 10 years ago
- Node wrapper around PDF.JS library to read and render PDFs☆52Updated 8 years ago
- NodeJS Named Entity Recognition, using Stanford NER (easy install)☆40Updated 8 years ago
- Extract text from pdfs that contain searchable pdf text☆116Updated 6 years ago
- Simple, lightweight and expressive web scraping with Node.js☆154Updated 3 years ago
- A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.☆52Updated 8 years ago
- Code Friends is a collaborative programming environment with real-time concurrent editing, and text/video chat. Collaborate with others r…☆61Updated 9 years ago