Automatically extract body content (and other cool stuff) from an html document
☆2,164May 26, 2023Updated 2 years ago
Alternatives and similar repositories for node-unfluff
Users that are interested in node-unfluff are comparing it to the libraries listed below
Sorting:
- ExtractContent for node.js☆15Feb 18, 2026Updated last month
- 📚 Turn any web page into a clean view☆2,521Apr 3, 2021Updated 4 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆346Aug 1, 2018Updated 7 years ago
- A standalone version of the readability lib☆11,002Jan 21, 2026Updated 2 months ago
- Node module that summarizes text using a naive summarization algorithm☆770Mar 1, 2026Updated 2 weeks ago
- The next web scraper. See through the <html> noise.☆5,906Feb 16, 2026Updated last month
- DigitalOcean API PHP 5.3+ library for Laravel 4☆19Dec 17, 2014Updated 11 years ago
- PhantomJS script to automate Application Cache manifest file generation☆64Feb 25, 2014Updated 12 years ago
- 📜 Extract meaningful content from the chaos of a web page☆5,778Jul 10, 2024Updated last year
- Emblem.js precompiler plugin for gulp☆10Mar 24, 2023Updated 2 years ago
- general natural language facilities for node☆10,875Feb 22, 2026Updated 3 weeks ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,638Mar 7, 2026Updated 2 weeks ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,693Dec 15, 2025Updated 3 months ago
- VK.com API wrapper module for Node.js☆12Aug 8, 2013Updated 12 years ago
- Get Readable Content from any page. Based on Arc90's readability project using cheerio engine.☆641Aug 10, 2018Updated 7 years ago
- To extract main article from given URL with Node.js☆1,871Sep 4, 2025Updated 6 months ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,070Mar 10, 2026Updated last week
- A base model for forms.☆95Oct 31, 2013Updated 12 years ago
- Just the facts -- web page content extraction☆1,279Jul 8, 2025Updated 8 months ago
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,978Updated this week
- modest natural-language processing☆12,052Feb 25, 2026Updated 3 weeks ago
- ☆14Apr 30, 2014Updated 11 years ago
- Scrapes a remote page and creates a summary with statistics☆37Aug 24, 2014Updated 11 years ago
- Work in progress transmit from Google Code☆1,127Jan 3, 2018Updated 8 years ago
- Node library to extract keywords from text☆57Aug 2, 2015Updated 10 years ago
- A chrome extension that applies a stylesheet to Hacker News' frontpage.☆65Aug 8, 2014Updated 11 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs☆1,530Apr 18, 2017Updated 8 years ago
- A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.☆52Jul 20, 2017Updated 8 years ago
- plugin to extract keywords and key-phrases☆338Oct 23, 2024Updated last year
- Machine-learning for Node.js☆1,053Mar 14, 2026Updated last week
- [UNMAINTAINED] Extract terms and keywords from a piece of text☆169Mar 12, 2014Updated 12 years ago
- DIG search and visualization user interface for the HT domain☆12Oct 2, 2017Updated 8 years ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆15,010Dec 6, 2025Updated 3 months ago
- AFINN-based sentiment analysis for Node.js.☆2,674May 18, 2020Updated 5 years ago
- WKWebView for React Native (+ a few nice things)☆12May 23, 2016Updated 9 years ago
- 👀 A Chrome extension helps you track webpages effortlessly☆10Mar 1, 2020Updated 6 years ago
- A Laravel 4 boilerplate package for creating web apps.☆15Jul 2, 2014Updated 11 years ago
- Node-Webkit REPL☆17Jul 12, 2015Updated 10 years ago
- Instagram Service Provider for Laravel 4.☆32Jun 20, 2014Updated 11 years ago