xissy / node-boilerpipe
A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.
☆52Updated 7 years ago
Alternatives and similar repositories for node-boilerpipe:
Users that are interested in node-boilerpipe are comparing it to the libraries listed below
- A simple node.js wrapper for stanford-core-nlp.☆149Updated 7 years ago
- Friendly web crawler for x-ray☆44Updated 2 years ago
- Rate limiter middleware, backed by Redis☆54Updated 2 years ago
- A PredictionIO 0.9+ client☆60Updated 6 years ago
- Streaming uploads to Amazon Web Service(AWS) S3 for NodeJS☆78Updated 3 years ago
- fetch & parse ATOM & RSS feeds with Node.js☆74Updated 6 years ago
- Tokenize paragraphs into sentences, and smaller tokens.☆48Updated last year
- Redis adapter for SocketCluster☆45Updated 5 years ago
- phantom driver for x-ray.☆111Updated 8 years ago
- A simple node.js wrapper for Stanford CoreNLP.☆77Updated 3 years ago
- ExtractContent for node.js☆15Updated 6 years ago
- NodeJS Named Entity Recognition, using Stanford NER (easy install)☆40Updated 7 years ago
- remote monitoring and debugging for socket.io☆452Updated 10 years ago
- An implementation of a google browserchannel server in node.js☆288Updated 4 years ago
- javascript implementation of the popular snowball word stemming nlp algorithm☆102Updated 14 years ago
- tools for working with Princeton's lexical database WordNet☆73Updated 6 years ago
- A simple-but-useful kNN library for NodeJS, comparing JSON Objects using Euclidean distances☆214Updated 9 years ago
- Parser for robots.txt for node.js☆67Updated 4 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆344Updated 6 years ago
- s3-streaming-upload is node.js library that listens to your stream and upload its data to Amazon S3 using ManagedUpload API.☆123Updated 2 years ago
- This stemmming module for Node.js provides stemming capability for a variety of languages using Dr. M.F. Porter's Snowball API.☆51Updated last month
- Lomath is a tensorial math library extended from lodash, with performant math functions applicable to tensors(multi-arrays). It also has …☆19Updated 9 years ago
- Node library to extract keywords from text☆58Updated 9 years ago
- ☆27Updated 6 years ago
- A 2nd generation spider to crawl any article site, automatic read title and article.☆43Updated 9 years ago
- Martin Porter's stemmer for node.js☆100Updated 4 years ago
- A web crawler/scraper/spider for nodejs☆66Updated 7 years ago
- Apache OpenNLP wrapper for Nodejs☆56Updated 6 years ago
- Headless is a Node.js wrapper for Xvfb, the virtual framebuffer☆93Updated 8 years ago
- Node.js module to extract and summarize html content☆42Updated 10 years ago