postlight / parserLinks
π Extract meaningful content from the chaos of a web page
β5,711Updated last year
Alternatives and similar repositories for parser
Users that are interested in parser are comparing it to the libraries listed below
Sorting:
- A standalone version of the readability libβ10,465Updated last week
- π Turn any web page into a clean viewβ2,517Updated 4 years ago
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.β2,570Updated 2 weeks ago
- Browser extension to curate, annotate, and discuss the most valuable content and ideas on the web. As individuals, teams and communities.β4,565Updated 6 months ago
- A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.β4,495Updated last month
- To extract main article from given URL with Node.jsβ1,750Updated last month
- A nice place to read on the web.β3,677Updated last week
- A flexible event/agent & automation system with lots of bees πβ6,444Updated 2 years ago
- Automatically extract body content (and other cool stuff) from an html documentβ2,161Updated 2 years ago
- Next-generation full-text search library for Browser and Node.jsβ13,359Updated last week
- A Beautiful Open Source RSS & Podcast App Powered by Getstream.ioβ9,227Updated 3 years ago
- Distributed crawler powered by Headless Chromeβ5,595Updated 2 years ago
- Fathom Lite. Simple, privacy-focused website analytics. Built with Golang & Preact.β7,933Updated last year
- The free Zapier/IFTTT alternative for developers to automate your workflows based on Github actionsβ3,296Updated 3 months ago
- Simple bookmark manager built with Goβ10,931Updated this week
- π An HTML to Markdown converter written in JavaScriptβ10,339Updated last month
- wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.β12,085Updated this week
- A low-level browser automation framework built on top of the Web Extensions API standard.β1,755Updated 7 years ago
- A bit like Solr, but much smaller and not as brightβ9,146Updated last year
- πΎ dn - offline full-text search and archiving for your Chromium-based browser.β3,854Updated 4 months ago
- A self-hosted, anti-social RSS reader.β4,063Updated this week
- π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and morβ¦β25,111Updated 4 months ago
- Based on lunr.js, but more flexible and customized.β2,074Updated 2 years ago
- Robust RSS, Atom, and RDF feed parsing in Node.jsβ1,978Updated last year
- HTTP-based JSON storage.β2,488Updated 2 years ago
- Deploy infinitely scalable serverless apps, apis, and sites in seconds to AWS.β8,814Updated last year
- A framework for extracting meaning from web pagesβ1,971Updated last year
- π Impossibly fast web search, made for static sites.β2,752Updated 2 years ago
- πΈ Polaroid for your codeβ6,846Updated 3 years ago
- A fast, bloat-free comments platform (Github mirror)β3,776Updated 2 years ago