Automatically extract body content (and other cool stuff) from an html document
☆2,164May 26, 2023Updated 2 years ago
Alternatives and similar repositories for node-unfluff
Users that are interested in node-unfluff are comparing it to the libraries listed below
Sorting:
- ExtractContent for node.js☆15Feb 18, 2026Updated last week
- 📚 Turn any web page into a clean view☆2,523Apr 3, 2021Updated 4 years ago
- The next web scraper. See through the <html> noise.☆5,906Feb 16, 2026Updated last week
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆346Aug 1, 2018Updated 7 years ago
- A standalone version of the readability lib☆10,939Jan 21, 2026Updated last month
- Emblem.js precompiler plugin for gulp☆10Mar 24, 2023Updated 2 years ago
- DigitalOcean API PHP 5.3+ library for Laravel 4☆19Dec 17, 2014Updated 11 years ago
- PhantomJS script to automate Application Cache manifest file generation☆64Feb 25, 2014Updated 12 years ago
- Node module that summarizes text using a naive summarization algorithm☆770Updated this week
- 📜 Extract meaningful content from the chaos of a web page☆5,775Jul 10, 2024Updated last year
- general natural language facilities for node☆10,864Feb 22, 2026Updated last week
- VK.com API wrapper module for Node.js☆12Aug 8, 2013Updated 12 years ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,626Feb 17, 2026Updated last week
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,692Dec 15, 2025Updated 2 months ago
- A base model for forms.☆95Oct 31, 2013Updated 12 years ago
- A chrome extension that applies a stylesheet to Hacker News' frontpage.☆65Aug 8, 2014Updated 11 years ago
- Vue-animate provides an easy way to use beautiful animations for your page. Ideal for hero style landing pages☆10Jun 5, 2017Updated 8 years ago
- Scrapes a remote page and creates a summary with statistics☆37Aug 24, 2014Updated 11 years ago
- ☆14Apr 30, 2014Updated 11 years ago
- Instagram Service Provider for Laravel 4.☆32Jun 20, 2014Updated 11 years ago
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,978Oct 20, 2023Updated 2 years ago
- To extract main article from given URL with Node.js☆1,868Sep 4, 2025Updated 5 months ago
- DIG search and visualization user interface for the HT domain☆12Oct 2, 2017Updated 8 years ago
- modest natural-language processing☆12,040Updated this week
- Use scrapy with a list of proxies generated from proxynova.com☆39Jan 3, 2013Updated 13 years ago
- Site provisioning 3PC for MODx Revolution☆26May 9, 2013Updated 12 years ago
- A Laravel 4 boilerplate package for creating web apps.☆15Jul 2, 2014Updated 11 years ago
- 👀 A Chrome extension helps you track webpages effortlessly☆10Mar 1, 2020Updated 6 years ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,063Dec 26, 2021Updated 4 years ago
- A simple PHP script to get Twitch HLS Stream by a channel name☆10May 13, 2025Updated 9 months ago
- Just the facts -- web page content extraction☆1,280Jul 8, 2025Updated 7 months ago
- ☆12Mar 21, 2016Updated 9 years ago
- WKWebView for React Native (+ a few nice things)☆12May 23, 2016Updated 9 years ago
- Html Content / Article Extractor in Scala - open sourced from Gravity Labs☆1,530Apr 18, 2017Updated 8 years ago
- Node-Webkit REPL☆17Jul 12, 2015Updated 10 years ago
- plugin to extract keywords and key-phrases☆338Oct 23, 2024Updated last year
- Discover, analyze and present data from the web and mobile in meaninful ways☆83Jul 16, 2013Updated 12 years ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆14,996Dec 6, 2025Updated 2 months ago
- Scrape Facebook profiles without the Open Graph API☆10Nov 6, 2016Updated 9 years ago