luin / readabilityLinks
📚 Turn any web page into a clean view
☆2,523Updated 4 years ago
Alternatives and similar repositories for readability
Users that are interested in readability are comparing it to the libraries listed below
Sorting:
- Automatically extract body content (and other cool stuff) from an html document☆2,163Updated 2 years ago
- A standalone version of the readability lib☆10,866Updated 2 weeks ago
- 📜 Extract meaningful content from the chaos of a web page☆5,763Updated last year
- A copy of the original Arc90 repo with links to many of the current ports.☆241Updated last year
- Work in progress transmit from Google Code☆1,127Updated 8 years ago
- Robust RSS, Atom, and RDF feed parsing in Node.js☆1,979Updated 2 years ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,886Updated last week
- Flexible event driven crawler for node.☆2,135Updated 4 years ago
- A complete and versatile web scraper.☆3,720Updated 5 years ago
- Distills the DOM☆665Updated 4 years ago
- Generate EPUB books from HTML with simple API in Node.js.☆454Updated 2 years ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.☆2,613Updated last week
- Just the facts -- web page content extraction☆1,280Updated 6 months ago
- node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!☆1,692Updated last month
- Distributed crawler powered by Headless Chrome☆5,721Updated 2 years ago
- Annotation tools for the web. Select text, images, or (nearly) anything else, and add your notes.☆2,750Updated 2 years ago
- A chrome extension to record your actions into a nightmare or puppeteer script☆2,765Updated last year
- Web Crawler/Spider for NodeJS + server-side jQuery ;-)☆6,785Updated 8 months ago
- A persistent, network resilient, full text search library for the browser and Node.js☆1,426Updated 10 months ago
- 📄 A command line tool to generate PDF from URL, HTML or Markdown files.☆1,262Updated 7 months ago
- oEmbed proxy. Supports over 1800 domains via custom parsers, oEmbed, Twitter Cards and Open Graph☆1,612Updated last week
- Getting started with Puppeteer and Chrome Headless for Web Scraping☆2,363Updated 5 years ago
- PhantomJS integration module for NodeJS☆3,530Updated 6 years ago
- A scriptable browser like PhantomJS, based on Firefox☆2,995Updated 2 years ago
- Advanced html to text converter☆1,686Updated 2 years ago
- The next web scraper. See through the <html> noise.☆5,908Updated 2 weeks ago
- Capture website screenshots☆9,752Updated 4 months ago
- DEPRECATED - A framework for extracting meaning from web pages☆1,968Updated 2 months ago
- A cross-browser JavaScript range and selection library.☆2,302Updated 2 weeks ago
- enjoy live editing (+markdown)☆4,813Updated 7 years ago