luin / readabilityLinks
📚 Turn any web page into a clean view
☆2,522Updated 4 years ago
Alternatives and similar repositories for readability
Users that are interested in readability are comparing it to the libraries listed below
Sorting:
- Automatically extract body content (and other cool stuff) from an html document☆2,161Updated 2 years ago
- A standalone version of the readability lib☆10,687Updated 3 weeks ago
- Work in progress transmit from Google Code☆1,126Updated 7 years ago
- 📜 Extract meaningful content from the chaos of a web page☆5,750Updated last year
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,979Updated 2 years ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,871Updated 7 months ago
- Distills the DOM☆665Updated 4 years ago
- To extract main article from given URL with Node.js☆1,846Updated 3 months ago
- Just the facts -- web page content extraction☆1,276Updated 5 months ago
- Flexible event driven crawler for node.☆2,134Updated 4 years ago
- A complete and versatile web scraper.☆3,719Updated 5 years ago
- Scrape/Crawl article from any site automatically. Make any web page readable, no matter Chinese or English.☆346Updated 7 years ago
- A persistent, network resilient, full text search library for the browser and Node.js☆1,420Updated 8 months ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,688Updated 3 years ago
- Natural language detection☆4,348Updated last year
- natural language processor powered by plugins part of the @unifiedjs collective☆2,420Updated 10 months ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,590Updated last week
- Node module that summarizes text using a naive summarization algorithm☆770Updated last year
- Distributed crawler powered by Headless Chrome☆5,673Updated 2 years ago
- A chrome extension to record your actions into a nightmare or puppeteer script☆2,763Updated last year
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,785Updated 6 months ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,051Updated 3 years ago
- HTML parsing/serialization toolset for Node.js. WHATWG HTML Living Standard (aka HTML5)-compliant.☆3,850Updated this week
- The fast & forgiving HTML and XML parser☆4,736Updated this week
- DEPRECATED - A framework for extracting meaning from web pages☆1,971Updated 3 weeks ago
- oEmbed proxy. Supports over 1800 domains via custom parsers, oEmbed, Twitter Cards and Open Graph☆1,602Updated last week
- Generate EPUB books from HTML with simple API in Node.js.☆454Updated 2 years ago
- Chrome Debugging Protocol interface for Node.js☆4,467Updated 9 months ago
- enjoy live editing (+markdown)☆4,815Updated 7 years ago
- Video player built using electron and node.js☆2,004Updated 6 years ago