ageitgey / node-unfluffLinks
Automatically extract body content (and other cool stuff) from an html document
โ2,160Updated 2 years ago
Alternatives and similar repositories for node-unfluff
Users that are interested in node-unfluff are comparing it to the libraries listed below
Sorting:
- ๐ Turn any web page into a clean viewโ2,517Updated 4 years ago
- Node module that summarizes text using a naive summarization algorithmโ769Updated 8 months ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.โ344Updated 6 years ago
- A complete and versatile web scraper.โ3,719Updated 4 years ago
- natural language processor powered by plugins part of the @unifiedjs collectiveโ2,408Updated 4 months ago
- Web scraper for NodeJSโ4,112Updated last year
- ๐ฎ A Node.js scraper for humans.โ4,055Updated last month
- The next web scraper. See through the <html> noise.โ5,901Updated last week
- Flexible event driven crawler for node.โ2,142Updated 4 years ago
- Part-of-speech utilities for node.js based on the WordNet database.โ476Updated 2 years ago
- A dialogue engine for creating chat botsโ1,646Updated 6 years ago
- artoo.js - the client-side scraping companion.โ1,113Updated 4 years ago
- Robust RSS, Atom, and RDF feed parsing in Node.jsโ1,973Updated last year
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.โ2,507Updated this week
- A framework for extracting meaning from web pagesโ1,972Updated last year
- A node server and module which allows for cross-domain page scraping on web documents with JSONP or POST.โ746Updated last year
- A persistent, network resilient, full text search library for the browser and Node.jsโ1,413Updated 2 months ago
- Machine-learning for Node.jsโ1,054Updated 4 months ago
- Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.โ716Updated last year
- A chrome extension to record your actions into a nightmare or puppeteer scriptโ2,767Updated 7 months ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and conโฆโ379Updated 2 years ago
- Natural-language event parser for Javascriptโ553Updated last year
- A Node.js module to search and scrape Google.โ454Updated 6 years ago
- Naive-Bayes Classifier for node.jsโ562Updated 3 years ago
- plugin to extract keywords and key-phrasesโ333Updated 8 months ago
- Work in progress transmit from Google Codeโ1,116Updated 7 years ago
- A simple natural language tool written for NodeJSโ388Updated 7 years ago
- node.js/express module to authenticate users without passwordโ1,951Updated 5 years ago
- PhearJS - render dynamic Javascript webpages to JSON with PhantomJSโ327Updated 7 years ago
- Run PhantomJS from Nodeโ1,454Updated 5 years ago