harvard-lil / waczerciserView external linksLinks
Create and edit WARC and WACZ files
☆23Dec 6, 2024Updated last year
Alternatives and similar repositories for waczerciser
Users that are interested in waczerciser are comparing it to the libraries listed below
Sorting:
- ☆56Apr 11, 2024Updated last year
- A tool for collection archival slivers of the web and web archives☆17Feb 18, 2025Updated 11 months ago
- Python script to create CDX index files of WARC data☆16Sep 7, 2018Updated 7 years ago
- Web Archiving Course☆23Mar 4, 2024Updated last year
- ☆11Nov 21, 2025Updated 2 months ago
- ☆16Oct 2, 2025Updated 4 months ago
- A client for the Archive-It And Webrecorder WASAPI Data Transfer API☆16Oct 18, 2019Updated 6 years ago
- Tools for helping you work with web platform archive downloads.☆18Mar 27, 2020Updated 5 years ago
- ArchiveWeb.page Express!☆14Nov 1, 2024Updated last year
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆54Dec 5, 2022Updated 3 years ago
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Aug 10, 2018Updated 7 years ago
- Rails application for the Archives Unleashed Cloud.☆11Jun 30, 2021Updated 4 years ago
- ☆10Dec 3, 2025Updated 2 months ago
- 🐳 A Docker getting-started kit for new businesses trying to self-host their data! Includes vetted apps for team communication, office do…☆11Dec 12, 2025Updated 2 months ago
- A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats☆50Nov 24, 2025Updated 2 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆55Updated this week
- ☆16Apr 19, 2025Updated 9 months ago
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆16Jun 10, 2021Updated 4 years ago
- Converts WARC files to static HTML☆51Sep 18, 2025Updated 4 months ago
- GraphPass is a utility to filter networks and provide a default visualization output for Gephi or SigmaJS.☆17Nov 14, 2020Updated 5 years ago
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System☆87Feb 16, 2021Updated 4 years ago
- Uses GitHub API to help repos switch from "master" to "main"☆15Jul 10, 2020Updated 5 years ago
- A Lit web-component for viewing a Whisper JSON transcript file☆14Dec 3, 2024Updated last year
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆75Jan 21, 2026Updated 3 weeks ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆39Nov 24, 2025Updated 2 months ago
- url canonicalization library for python and java☆39May 22, 2022Updated 3 years ago
- ReproZip for the Preservation of Web Applications☆17May 6, 2024Updated last year
- Web archive index server based on RocksDB☆38Feb 1, 2026Updated last week
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆187Sep 3, 2025Updated 5 months ago
- Islandora Drush module for performing Create, Read, Update, and Delete operations on datastreams.☆15May 9, 2022Updated 3 years ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆20Oct 9, 2025Updated 4 months ago
- ARK minter, binder, resolver☆23Dec 11, 2025Updated 2 months ago
- An IG focused on improving Islandora as an IR platform☆13Jan 19, 2023Updated 3 years ago
- Mounts WARC files on Windows☆16Apr 20, 2019Updated 6 years ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Feb 2, 2024Updated 2 years ago
- An in-development framework for managing data migrations from previous versions to 4.x.☆13Apr 24, 2025Updated 9 months ago
- Flask web application that runs the inlibraries.com website☆17Dec 2, 2016Updated 9 years ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆38Apr 23, 2019Updated 6 years ago
- Simultaneously the simplest and most powerful Argon2 implemenation in Python☆21May 15, 2025Updated 8 months ago