ageitgey / node-unfluffLinks
Automatically extract body content (and other cool stuff) from an html document
☆2,158Updated 2 years ago
Alternatives and similar repositories for node-unfluff
Users that are interested in node-unfluff are comparing it to the libraries listed below
Sorting:
- Node module that summarizes text using a naive summarization algorithm☆770Updated 11 months ago
- A complete and versatile web scraper.☆3,718Updated 4 years ago
- plugin to extract keywords and key-phrases☆334Updated 10 months ago
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,976Updated last year
- natural language processor powered by plugins part of the @unifiedjs collective☆2,414Updated 7 months ago
- 📚 Turn any web page into a clean view☆2,518Updated 4 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆344Updated 7 years ago
- Flexible event driven crawler for node.☆2,138Updated 4 years ago
- Part-of-speech utilities for node.js based on the WordNet database.☆475Updated 2 years ago
- artoo.js - the client-side scraping companion.☆1,115Updated 4 years ago
- Naive-Bayes Classifier for node.js☆563Updated 3 years ago
- A persistent, network resilient, full text search library for the browser and Node.js☆1,419Updated 5 months ago
- A chrome extension to record your actions into a nightmare or puppeteer script☆2,767Updated 10 months ago
- Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.☆717Updated last year
- 🔮 A Node.js scraper for humans.☆4,065Updated 2 months ago
- Easy website screenshots in Node.js☆2,117Updated 6 years ago
- The next web scraper. See through the <html> noise.☆5,901Updated last month
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,687Updated 2 years ago
- A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and con…☆380Updated 2 years ago
- A Node.js module to search and scrape Google.☆456Updated 6 years ago
- A search server that can be installed with npm☆656Updated last month
- Famous sorting algorithms based on vote popularity and time implemented for nodejs☆375Updated 7 years ago
- Web scraping library made by the Phantombuster team. Modern, simple & works on all websites. (Deprecated)☆500Updated 5 years ago
- Run PhantomJS from Node☆1,453Updated 5 years ago
- A dialogue engine for creating chat bots☆1,647Updated 6 years ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,563Updated this week
- Web scraper for NodeJS☆4,115Updated last year
- fasttag part of speech tagger javascript implementation☆279Updated 5 years ago
- A simple natural language tool written for NodeJS☆389Updated 7 years ago
- A framework for extracting meaning from web pages☆1,971Updated last year