π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.
β201Sep 3, 2025Updated 9 months ago
Alternatives and similar repositories for scoop
Users that are interested in scoop are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β43Nov 24, 2025Updated 6 months ago
- JavaScript module and CLI tool for working with web archive data using the WACZ format specification.β17Mar 11, 2025Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β58Jun 12, 2026Updated last week
- Create and edit WARC and WACZ filesβ29Dec 6, 2024Updated last year
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archivesβ16Jun 10, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ29Jun 30, 2023Updated 2 years ago
- β60Apr 11, 2024Updated 2 years ago
- β11Nov 21, 2025Updated 6 months ago
- A tool for collection archival slivers of the web and web archivesβ19Jun 1, 2026Updated 2 weeks ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker containerβ1,058Updated this week
- Command line tool for digging into WARC filesβ49Jun 10, 2026Updated last week
- wabac.js - Web Archive Browsing Augmentation Clientβ126Updated this week
- β17Apr 16, 2026Updated 2 months ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β421Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ArchiveWeb.page Express!β14Nov 1, 2024Updated last year
- Indelible linksβ512Jun 11, 2026Updated last week
- Serverless replay of web archives directly in the browserβ952Jun 9, 2026Updated last week
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β20Jul 11, 2025Updated 11 months ago
- Snapshots a web page to get it as a static, self-contained HTML document.β301Sep 18, 2022Updated 3 years ago
- β17Feb 12, 2024Updated 2 years ago
- Web Archiving Courseβ23Mar 4, 2024Updated 2 years ago
- This is a metadata assessment tool to query spreadsheet-based digital collection metadata against lexicons of offensive and outdated termβ¦β18Jun 18, 2025Updated last year
- Python library for using the CaltechDATA APIβ12Updated this week
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.β272Feb 11, 2025Updated last year
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β21Feb 2, 2024Updated 2 years ago
- β10Dec 3, 2025Updated 6 months ago
- A Github Action for turning Markdown into ReSpec HTMLβ16Jun 6, 2024Updated 2 years ago
- Consortia Collaborating on a Platform for Usage Statisticsβ11Aug 7, 2025Updated 10 months ago
- Self hosting code for Recogito-Studioβ22Apr 13, 2026Updated 2 months ago
- Typesafe IIIF presentation v3 parsing without external dependenciesβ12May 20, 2026Updated 3 weeks ago
- Converts WARC files to static HTMLβ56Sep 18, 2025Updated 9 months ago
- This command line converts .webarchive file to resources embed .html fileβ22Mar 3, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A VUE IIIF viewerβ15Jun 5, 2026Updated 2 weeks ago
- React application for the Digital Public Library of America websiteβ32Updated this week
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.β46Dec 4, 2017Updated 8 years ago
- Python WSGI Middleware for adding HTTP/S proxy support to any WSGI Applicationβ24Oct 27, 2020Updated 5 years ago
- Webrecorders DevTools Protocol Automation Libraryβ18Oct 18, 2022Updated 3 years ago
- A IIIF static tile and manifest generator built using Python to generate IIIF tiled images and manifests. This application was put togetβ¦β10Updated this week
- An Awesome List for getting started with web archivingβ2,568Apr 27, 2026Updated last month