Specifications developed and maintained by the Webrecorder community.
☆140Oct 16, 2025Updated 4 months ago
Alternatives and similar repositories for specs
Users that are interested in specs are comparing it to the libraries listed below
Sorting:
- Web archive index server based on RocksDB☆38Updated this week
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆55Feb 10, 2026Updated 3 weeks ago
- A client for the Archive-It And Webrecorder WASAPI Data Transfer API☆16Oct 18, 2019Updated 6 years ago
- ☆58Apr 11, 2024Updated last year
- Serverless replay of web archives directly in the browser☆914Feb 20, 2026Updated last week
- CDXJ Indexing of WARC/ARCs☆33Dec 10, 2024Updated last year
- ArchiveWeb.page Express!☆14Nov 1, 2024Updated last year
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 6 years ago
- WASAPI data transfer APIs☆48Apr 23, 2022Updated 3 years ago
- ☆11Nov 21, 2025Updated 3 months ago
- Webrecorder Automated In-Page Behavior Framework☆13Apr 21, 2021Updated 4 years ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆984Updated this week
- ☆16Oct 2, 2025Updated 5 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆93Apr 22, 2025Updated 10 months ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆54Dec 5, 2022Updated 3 years ago
- A Github Action for turning Markdown into ReSpec HTML☆15Jun 6, 2024Updated last year
- ☆27Oct 14, 2022Updated 3 years ago
- A social media open post web archiving tool☆26Feb 4, 2026Updated 3 weeks ago
- JavaScript module and CLI tool for working with web archive data using the WACZ format specification.☆17Mar 11, 2025Updated 11 months ago
- Streaming WARC/ARC library for fast web archive IO☆451Dec 10, 2024Updated last year
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System☆87Feb 16, 2021Updated 5 years ago
- A command line utility for listing and searching snapshots in web archives☆17Dec 21, 2023Updated 2 years ago
- Download digitized books from Internet Archive and view with IIIF, locally and offline.☆38Apr 19, 2024Updated last year
- Command line tool for digging into WARC files☆51Updated this week
- Centralised repository for WARC usage specifications.☆125Oct 12, 2025Updated 4 months ago
- The ArchiveWeb.page Site☆32Nov 7, 2025Updated 3 months ago
- Golang WARC (Web ARChive) Library☆30Aug 6, 2019Updated 6 years ago
- Web archiving using Google Chrome☆46Dec 30, 2019Updated 6 years ago
- A S3 hybrid storage interface for dat and hyperdrive☆13Jul 31, 2018Updated 7 years ago
- JS Streaming WARC IO optimized for Browser and Node☆53Feb 17, 2026Updated 2 weeks ago
- A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!☆1,414Feb 7, 2026Updated 3 weeks ago
- Wrapper for the Hypothes.is API☆19Aug 29, 2019Updated 6 years ago
- url canonicalization library for python and java☆39May 22, 2022Updated 3 years ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆39Nov 24, 2025Updated 3 months ago
- ☆16Apr 19, 2025Updated 10 months ago
- Core Python Web Archiving Toolkit for replay and recording of web archives☆1,627Jan 21, 2026Updated last month
- Trough: Big data, small databases.☆41Jul 25, 2024Updated last year
- GraphPass is a utility to filter networks and provide a default visualization output for Gephi or SigmaJS.☆17Nov 14, 2020Updated 5 years ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆388Updated this week