ArchiveBox / abx-spec-behaviorsLinks
π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser environments, puppeteer, playwright, extensions, AI tools, and many other contexts with minimal adjustment.
β18Updated 3 months ago
Alternatives and similar repositories for abx-spec-behaviors
Users that are interested in abx-spec-behaviors are comparing it to the libraries listed below
Sorting:
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β50Updated 2 weeks ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β177Updated last month
- A list of things related to software, literature, and other content for π£ Mementoβ102Updated last year
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewingβ¦β97Updated last week
- A Memento Aggregator CLI and Server in Goβ69Updated 7 months ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Updated last year
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each pageβ¦β40Updated last year
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β55Updated 2 months ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ28Updated 2 years ago
- Converts WARC files to static HTMLβ49Updated last month
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ97Updated 7 years ago
- Export your Github activity: events, repositories, stars, etc.β52Updated 3 months ago
- A social media open post web archiving toolβ27Updated 3 weeks ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from Aβ¦β18Updated last year
- Tool to index and serve HTML files. Powered by Datasette.β107Updated 3 years ago
- β¬οΈ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). π β¦β86Updated 2 months ago
- A dockerized, queued high fidelity web archiver based on Squidwarcβ61Updated last year
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myseβ¦β21Updated last year
- CDXJ Indexing of WARC/ARCsβ29Updated 10 months ago
- A tool for collection archival slivers of the web and web archivesβ16Updated 8 months ago
- Save data from Mastodon to a SQLite databaseβ28Updated 3 months ago
- Command line tool for digging into WARC filesβ46Updated last month
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)β146Updated last month
- Save pages to the Wayback Machine as part of your CI/CD pipelineβ31Updated this week
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β344Updated last week
- linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.β13Updated 2 years ago
- Comparing warc filesβ17Updated 6 years ago
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ186Updated last year
- β13Updated 3 weeks ago
- wabac.js - Web Archive Browsing Augmentation Clientβ114Updated 2 weeks ago