luin / readabilityLinks
📚 Turn any web page into a clean view
☆2,520Updated 4 years ago
Alternatives and similar repositories for readability
Users that are interested in readability are comparing it to the libraries listed below
Sorting:
- Automatically extract body content (and other cool stuff) from an html document☆2,156Updated 2 years ago
- A standalone version of the readability lib☆10,350Updated 2 weeks ago
- 📜 Extract meaningful content from the chaos of a web page☆5,700Updated last year
- Work in progress transmit from Google Code☆1,122Updated 7 years ago
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,975Updated last year
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,839Updated 3 months ago
- Distills the DOM☆663Updated 3 years ago
- To extract main article from given URL with Node.js☆1,736Updated 3 months ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,548Updated 2 weeks ago
- Flexible event driven crawler for node.☆2,140Updated 4 years ago
- A complete and versatile web scraper.☆3,721Updated 4 years ago
- Just the facts -- web page content extraction☆1,271Updated last month
- oEmbed proxy. Supports over 1800 domains via custom parsers, oEmbed, Twitter Cards and Open Graph☆1,588Updated this week
- 🔮 A Node.js scraper for humans.☆4,064Updated last month
- A persistent, network resilient, full text search library for the browser and Node.js☆1,418Updated 4 months ago
- A chrome extension to record your actions into a nightmare or puppeteer script☆2,766Updated 9 months ago
- RSS feed generator for Node.☆1,034Updated 5 months ago
- Distributed crawler powered by Headless Chrome☆5,589Updated 2 years ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,684Updated 2 years ago
- Based on lunr.js, but more flexible and customized.☆2,075Updated 2 years ago
- Advanced html to text converter☆1,667Updated last year
- Generate EPUB books from HTML with simple API in Node.js.☆449Updated 2 years ago
- natural language processor powered by plugins part of the @unifiedjs collective☆2,411Updated 6 months ago
- A framework for extracting meaning from web pages☆1,972Updated last year
- Easy website screenshots in Node.js☆2,117Updated 6 years ago
- Run PhantomJS from Node☆1,453Updated 5 years ago
- The next web scraper. See through the <html> noise.☆5,899Updated 2 weeks ago
- A scriptable browser like PhantomJS, based on Firefox☆2,995Updated 2 years ago
- Natural language detection☆4,303Updated last year
- Html Content / Article Extractor, web scrapping lib in Python☆4,047Updated 3 years ago