π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.
β191Sep 3, 2025Updated 6 months ago
Alternatives and similar repositories for scoop
Users that are interested in scoop are comparing it to the libraries listed below
Sorting:
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β42Nov 24, 2025Updated 3 months ago
- JavaScript module and CLI tool for working with web archive data using the WACZ format specification.β17Mar 11, 2025Updated 11 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β55Feb 10, 2026Updated last month
- Create and edit WARC and WACZ filesβ24Dec 6, 2024Updated last year
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ30Jun 30, 2023Updated 2 years ago
- β17Oct 2, 2025Updated 5 months ago
- β59Apr 11, 2024Updated last year
- Command line tool for digging into WARC filesβ51Feb 27, 2026Updated last week
- A tool for collection archival slivers of the web and web archivesβ17Feb 18, 2025Updated last year
- wabac.js - Web Archive Browsing Augmentation Clientβ123Updated this week
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archivesβ16Jun 10, 2021Updated 4 years ago
- β11Nov 21, 2025Updated 3 months ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker containerβ986Mar 3, 2026Updated last week
- Web Archiving Courseβ23Mar 4, 2024Updated 2 years ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β390Updated this week
- This is a metadata assessment tool to query spreadsheet-based digital collection metadata against lexicons of offensive and outdated termβ¦β18Jun 18, 2025Updated 8 months ago
- Indelible linksβ501Feb 25, 2026Updated last week
- Serverless replay of web archives directly in the browserβ916Updated this week
- ArchiveWeb.page Express!β14Nov 1, 2024Updated last year
- Self hosting code for Recogito-Studioβ20Updated this week
- A IIIF static tile and manifest generator built using Python to generate IIIF tiled images and manifests. This application was put togetβ¦β10Mar 2, 2026Updated last week
- Snapshots a web page to get it as a static, self-contained HTML document.β300Sep 18, 2022Updated 3 years ago
- A VUE IIIF viewerβ14Dec 14, 2025Updated 2 months ago
- Typesafe IIIF presentation v3 parsing without external dependenciesβ12Dec 16, 2025Updated 2 months ago
- Converts WARC files to static HTMLβ51Sep 18, 2025Updated 5 months ago
- Python library for using the CaltechDATA APIβ12Feb 27, 2026Updated last week
- The official Internet Archive IIIF serviceβ26Feb 3, 2026Updated last month
- β17Mar 31, 2025Updated 11 months ago
- Comparing warc filesβ17Feb 21, 2019Updated 7 years ago
- A Github Action for turning Markdown into ReSpec HTMLβ15Jun 6, 2024Updated last year
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior Systemβ87Feb 16, 2021Updated 5 years ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml formatβ54Dec 5, 2022Updated 3 years ago
- CDXJ Indexing of WARC/ARCsβ33Dec 10, 2024Updated last year
- An Awesome List for getting started with web archivingβ2,498Updated this week
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formatsβ50Nov 24, 2025Updated 3 months ago
- A client for the Archive-It And Webrecorder WASAPI Data Transfer APIβ16Oct 18, 2019Updated 6 years ago
- A Memento Aggregator CLI and Server in Goβ77Mar 4, 2025Updated last year
- Library of Congress Labs, Artist in Residency program project. Speculative Annotation.β16Jul 28, 2021Updated 4 years ago
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Jul 11, 2025Updated 7 months ago