ageitgey / node-unfluffLinks
Automatically extract body content (and other cool stuff) from an html document
โ2,163Updated 2 years ago
Alternatives and similar repositories for node-unfluff
Users that are interested in node-unfluff are comparing it to the libraries listed below
Sorting:
- Node module that summarizes text using a naive summarization algorithmโ770Updated 2 weeks ago
- ๐ Turn any web page into a clean viewโ2,523Updated 4 years ago
- A complete and versatile web scraper.โ3,720Updated 5 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.โ346Updated 7 years ago
- Robust RSS, Atom, and RDF feed parsing in Node.jsโ1,979Updated 2 years ago
- plugin to extract keywords and key-phrasesโ337Updated last year
- artoo.js - the client-side scraping companion.โ1,118Updated 4 years ago
- Flexible event driven crawler for node.โ2,135Updated 4 years ago
- natural language processor powered by plugins part of the @unifiedjs collectiveโ2,430Updated last year
- Part-of-speech utilities for node.js based on the WordNet database.โ477Updated 3 years ago
- A persistent, network resilient, full text search library for the browser and Node.jsโ1,426Updated 10 months ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.โ2,613Updated last week
- A Node.js module to search and scrape Google.โ456Updated 7 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and conโฆโ382Updated 3 years ago
- A search server that can be installed with npmโ658Updated 5 months ago
- Naive-Bayes Classifier for node.jsโ563Updated 4 years ago
- Declarative DOM extraction expression evaluator. ๐จโโ๏ธโ690Updated 5 years ago
- Easy website screenshots in Node.jsโ2,119Updated 6 years ago
- The next web scraper. See through the <html> noise.โ5,908Updated 2 weeks ago
- Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)โ497Updated 5 years ago
- A dialogue engine for creating chat botsโ1,643Updated 6 years ago
- Famous sorting algorithms based on vote popularity and time implemented for nodejsโ375Updated 7 years ago
- Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.โ723Updated last year
- A chrome extension to record your actions into a nightmare or puppeteer scriptโ2,765Updated last year
- ๐ฎ A Node.js scraper for humans.โ4,070Updated 3 months ago
- Natural language detectionโ4,370Updated last year
- RSS feed generator for Node.โ1,040Updated 2 months ago
- A collaborative filtering based recommendation engine and NPM module built on top of Node.js and Redis. The engine uses the Jaccard coeffโฆโ815Updated 5 years ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!โ1,692Updated last month
- Work in progress transmit from Google Codeโ1,127Updated 8 years ago