zytedata / web-snapLinks
Create "perfect" snapshots of web pages
β33Updated 2 months ago
Alternatives and similar repositories for web-snap
Users that are interested in web-snap are comparing it to the libraries listed below
Sorting:
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β18Updated 3 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β50Updated last week
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Updated last year
- A curated list of well-known URIs, resources, guides and tools (RFC 5785)β78Updated last year
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β177Updated last month
- Tool to index and serve HTML files. Powered by Datasette.β107Updated 3 years ago
- Awesome links related to RSS, ATOM, and Syndication formats.β60Updated last year
- A framework for quick web archiving; canonical repository: https://gitea.arpa.li/JustAnotherArchivist/qwarcβ29Updated 4 years ago
- A self-hosted bookmark database with full-text page content searchβ95Updated 4 months ago
- Coldbrew is Python compiled into JavaScript using Emscripten.β31Updated 2 years ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myseβ¦β21Updated last year
- Run a Personal VPN with global exit nodes and proxy via Tailscale IPNβ43Updated 6 months ago
- π‘οΈπ§ Protect e-mails against spam and scraping botsβ32Updated 9 months ago
- The Toolkit API, app, and browser extension. Start preserving now.β47Updated 3 weeks ago
- Create a static website with Fly - HTML from the exampleβ21Updated last year
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.β130Updated last month
- Create a SQLite database containing data pulled from Hacker Newsβ53Updated 2 years ago
- A dockerized, queued high fidelity web archiver based on Squidwarcβ61Updated last year
- Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supportβ¦β112Updated 2 years ago
- Embed any reddit post onto your website!β24Updated 4 years ago
- A single tab web browser built with puppeteer. Also, no client-side JS. Viewport is streamed with MJPEG. For realz.β56Updated 2 years ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β342Updated this week
- A browser extension that can be installed by volunteers to participate in mwmbl distributed crawling.β26Updated 3 months ago
- Lightweight JavaScript library to interact with Chromium-based browsers via the Chrome DevTools Protocolβ25Updated last year
- Convert an online sitemap to Atom, RSS and JSON feedsβ61Updated 2 years ago
- Securely collect browsing history over browsers.β94Updated 4 years ago
- Export your Github activity: events, repositories, stars, etc.β52Updated 3 months ago
- Collection of useful utilities for working with Google Cloud Storage.β14Updated 2 years ago
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- Pegao is a community about lists of links on topics of interest.β13Updated 2 years ago