ageitgey / node-unfluff
Automatically extract body content (and other cool stuff) from an html document
☆2,150Updated last year
Related projects ⓘ
Alternatives and complementary repositories for node-unfluff
- Node module that summarizes text using a naive summarization algorithm☆769Updated 3 weeks ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆343Updated 6 years ago
- 📚 Turn any web page into a clean view☆2,486Updated 3 years ago
- A complete and versatile web scraper.☆3,709Updated 4 years ago
- A search server that can be installed with npm☆655Updated last month
- natural language processor powered by plugins part of the @unifiedjs collective☆2,360Updated 3 weeks ago
- artoo.js - the client-side scraping companion.☆1,102Updated 3 years ago
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,971Updated last year
- A simple natural language tool written for NodeJS☆388Updated 6 years ago
- plugin to extract keywords and key-phrases☆327Updated 2 weeks ago
- Flexible event driven crawler for node.☆2,140Updated 3 years ago
- Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)☆501Updated 4 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆376Updated last year
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,333Updated 2 weeks ago
- 🔮 A Node.js scraper for humans.☆4,010Updated last month
- Work in progress transmit from Google Code☆1,110Updated 6 years ago
- 🔐 Free, automated HTTPS for NodeJS made easy.☆1,181Updated 2 years ago
- Famous sorting algorithms based on vote popularity and time implemented for nodejs☆374Updated 6 years ago
- Highly scalable Node.js scraping framework for mobsters☆298Updated 2 years ago
- A Node.js module to search and scrape Google.☆454Updated 6 years ago
- An exercise in unsupervised machine learning: Extract Article's Text in HTml documents.☆434Updated 8 months ago
- Part-of-speech utilities for node.js based on the WordNet database.☆475Updated last year
- Naive-Bayes Classifier for node.js☆561Updated 3 years ago
- Machine-learning for Node.js☆1,054Updated 3 months ago
- Easy website screenshots in Node.js☆2,123Updated 5 years ago
- Artisanal inbound emails for every web app☆1,952Updated 3 years ago
- Tiny and blazing-fast fuzzy search in JavaScript☆2,713Updated last year
- A node module for Google's Universal Analytics and Measurement Protocol☆961Updated last year
- Run PhantomJS from Node☆1,453Updated 4 years ago
- Natural-language event parser for Javascript☆538Updated last year