postlight / parserLinks
π Extract meaningful content from the chaos of a web page
β5,660Updated 11 months ago
Alternatives and similar repositories for parser
Users that are interested in parser are comparing it to the libraries listed below
Sorting:
- A standalone version of the readability libβ10,154Updated last week
- π Turn any web page into a clean viewβ2,518Updated 4 years ago
- A command-line tool to turn web pages into readable PDF, EPUB, HTML, or Markdown docs.β4,444Updated 6 months ago
- To extract main article from given URL with Node.jsβ1,721Updated last month
- Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.β2,510Updated this week
- Browser extension to curate, annotate, and discuss the most valuable content and ideas on the web. As individuals, teams and communities.β4,538Updated 3 months ago
- Distributed crawler powered by Headless Chromeβ5,582Updated 2 years ago
- πΎ dn - offline full-text search and archiving for your Chromium-based browser.β3,846Updated last month
- Fathom Lite. Simple, privacy-focused website analytics. Built with Golang & Preact.β7,913Updated last year
- Zero is a web server to simplify web development.β5,840Updated last year
- Easily and securely share files from the command line. A fully featured Firefox Send client.β7,155Updated 4 months ago
- π Unlimited Google Drive Storage by splitting binary files into base64β4,358Updated 3 years ago
- Simple bookmark manager built with Goβ10,620Updated this week
- β¬οΈ CLI tool and library for saving complete web pages as a single HTML fileβ13,768Updated 2 weeks ago
- A Beautiful Open Source RSS & Podcast App Powered by Getstream.ioβ9,050Updated 3 years ago
- wallabag is a self hostable application for saving web pages: Save and classify articles. Read them later. Freely.β11,615Updated this week
- The free Zapier/IFTTT alternative for developers to automate your workflows based on Github actionsβ3,277Updated 2 weeks ago
- π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and morβ¦β24,144Updated last month
- RSS-proxy allows you to do create an RSS or ATOM feed of almost any website, just by analyzing just the static HTML structure.β1,857Updated 5 months ago
- Chrome extension that records your browser interactions and generates a Playwright or Puppeteer script.β15,136Updated 2 years ago
- The most intuitive Static Site CMS designed for SEO-optimized and privacy-focused websites.β6,783Updated 2 weeks ago
- natural language processor powered by plugins part of the @unifiedjs collectiveβ2,408Updated 4 months ago
- A low-level browser automation framework built on top of the Web Extensions API standard.β1,744Updated 6 years ago
- β‘οΈFaster subsequent page-loads by prefetching in-viewport links during idle timeβ11,136Updated 2 weeks ago
- Make your siteβs pages instant in 1 minute and improve your conversion rate by 1%β6,161Updated 5 months ago
- Deploy static websites in seconds - with HTTPS, a global CDN, and custom domains.β1,740Updated last month
- A Headless Chrome rendering solutionβ5,950Updated 2 years ago
- A copy of the original Arc90 repo with links to many of the current ports.β231Updated last year
- Web Extension to save a faithful copy of an entire web page in a self-extracting ZIP fileβ1,884Updated 9 months ago
- Getting started with Puppeteer and Chrome Headless for Web Scrapingβ2,359Updated 4 years ago