Automatically extract body content (and other cool stuff) from an html document
☆2,163May 26, 2023Updated 2 years ago
Alternatives and similar repositories for node-unfluff
Users that are interested in node-unfluff are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ExtractContent for node.js☆15Feb 18, 2026Updated last month
- 📚 Turn any web page into a clean view☆2,524Apr 3, 2021Updated 5 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆346Aug 1, 2018Updated 7 years ago
- A standalone version of the readability lib☆11,069Jan 21, 2026Updated 2 months ago
- Node module that summarizes text using a naive summarization algorithm☆770Apr 2, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The next web scraper. See through the <html> noise.☆5,903Feb 16, 2026Updated last month
- DigitalOcean API PHP 5.3+ library for Laravel 4☆19Dec 17, 2014Updated 11 years ago
- PhantomJS script to automate Application Cache manifest file generation☆64Feb 25, 2014Updated 12 years ago
- 📜 Extract meaningful content from the chaos of a web page☆5,779Jul 10, 2024Updated last year
- Emblem.js precompiler plugin for gulp☆10Mar 24, 2023Updated 3 years ago
- general natural language facilities for node☆10,872Feb 22, 2026Updated last month
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,660Mar 20, 2026Updated 3 weeks ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,693Dec 15, 2025Updated 3 months ago
- VK.com API wrapper module for Node.js☆12Aug 8, 2013Updated 12 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Get Readable Content from any page. Based on Arc90's readability project using cheerio engine.☆639Aug 10, 2018Updated 7 years ago
- To extract main article from given URL with Node.js☆1,871Sep 4, 2025Updated 7 months ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,073Mar 10, 2026Updated last month
- A base model for forms.☆95Oct 31, 2013Updated 12 years ago
- Just the facts -- web page content extraction☆1,276Jul 8, 2025Updated 9 months ago
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,979Mar 27, 2026Updated 2 weeks ago
- modest natural-language processing☆12,064Feb 25, 2026Updated last month
- ☆14Apr 30, 2014Updated 11 years ago
- Scrapes a remote page and creates a summary with statistics☆37Aug 24, 2014Updated 11 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Work in progress transmit from Google Code☆1,127Jan 3, 2018Updated 8 years ago
- Node library to extract keywords from text☆57Aug 2, 2015Updated 10 years ago
- A chrome extension that applies a stylesheet to Hacker News' frontpage.☆65Aug 8, 2014Updated 11 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs☆1,530Apr 18, 2017Updated 8 years ago
- A node.js wrapper for Boilerpipe, an excellent Java library for boilerplate removal and fulltext extraction from HTML pages.☆52Jul 20, 2017Updated 8 years ago
- plugin to extract keywords and key-phrases☆338Oct 23, 2024Updated last year
- Machine-learning for Node.js☆1,052Apr 2, 2026Updated last week
- [UNMAINTAINED] Extract terms and keywords from a piece of text☆169Mar 12, 2014Updated 12 years ago
- DIG search and visualization user interface for the HT domain☆12Oct 2, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆15,024Mar 23, 2026Updated 2 weeks ago
- AFINN-based sentiment analysis for Node.js.☆2,676May 18, 2020Updated 5 years ago
- WKWebView for React Native (+ a few nice things)☆12May 23, 2016Updated 9 years ago
- 👀 A Chrome extension helps you track webpages effortlessly☆10Mar 1, 2020Updated 6 years ago
- A Laravel 4 boilerplate package for creating web apps.☆15Jul 2, 2014Updated 11 years ago
- Node-Webkit REPL☆17Jul 12, 2015Updated 10 years ago
- Instagram Service Provider for Laravel 4.☆32Jun 20, 2014Updated 11 years ago