xissy / node-boilerpipeLinks
A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.
☆52Updated 8 years ago
Alternatives and similar repositories for node-boilerpipe
Users that are interested in node-boilerpipe are comparing it to the libraries listed below
Sorting:
- A simple-but-useful kNN library for NodeJS, comparing JSON Objects using Euclidean distances☆213Updated 10 years ago
- phantom driver for x-ray.☆111Updated 9 years ago
- Node library to extract keywords from text☆57Updated 10 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆346Updated 7 years ago
- Friendly web crawler for x-ray☆44Updated 2 years ago
- A simple node.js wrapper for stanford-core-nlp.☆148Updated 8 years ago
- A 2nd generation spider to crawl any article site, automatic read title and article.☆43Updated 9 years ago
- sandcrawler.js - the server-side scraping companion.☆109Updated 9 years ago
- Node.js module to extract and summarize html content☆42Updated 11 years ago
- Martin Porter's stemmer for node.js☆100Updated 5 years ago
- High-availability redis in Node.js.☆154Updated 7 years ago
- Streaming uploads to Amazon Web Service(AWS) S3 for NodeJS☆78Updated 3 years ago
- Headless is a Node.js wrapper for Xvfb, the virtual framebuffer☆94Updated 9 years ago
- Apache Tika bridge for Node.js. Text and metadata extraction, language detection and more.☆141Updated last year
- WordNet Database files (previously WNdb)☆218Updated 5 years ago
- ExtractContent for node.js☆15Updated 6 years ago
- Automatically extracts structured information from webpages☆109Updated 3 years ago
- Crawler is a web spider written with Nodejs. It gives you the full power of jQuery on the server to parse a big number of pages as they a…☆41Updated 9 years ago
- Web scraping and HTML-reprocessing. The easy way.☆390Updated 6 years ago
- Simple and easy to use full text search engine.☆81Updated 7 years ago
- remote monitoring and debugging for socket.io☆451Updated 10 years ago
- Node.js module for the Bing Search API (Cognitive Services)☆57Updated 4 years ago
- A javascript library for collaborative filtering and recommendation engines designed for node.js☆201Updated 7 years ago
- Apache OpenNLP wrapper for Nodejs☆56Updated 6 years ago
- Native javascript implementation of the standard Sphinx API☆76Updated 6 years ago
- A simple node.js wrapper for Stanford CoreNLP.☆77Updated 3 years ago
- A helper robot written in node javascript☆74Updated 13 years ago
- Scrapes a remote page and creates a summary with statistics☆38Updated 11 years ago
- A new kind of headless webkit integration for nodejs; a great alternative to phantomjs.☆848Updated 5 years ago
- NodeJS Named Entity Recognition, using Stanford NER (easy install)☆40Updated 8 years ago