webrecorder/specs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/webrecorder/specs)

webrecorder / specs

Specifications developed and maintained by the Webrecorder community.

☆142

Alternatives and similar repositories for specs

Users that are interested in specs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

webrecorder / browsertrix-behaviors
View on GitHub
Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.
☆58Updated this week
webrecorder / py-wacz
View on GitHub
☆61Apr 11, 2024Updated 2 years ago
webrecorder / pywb-remote-browsers
View on GitHub
Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives
☆16Jun 10, 2021Updated 5 years ago
UAlbanyArchives / mailbagit
View on GitHub
A tool for creating and managing Mailbags, a package for preserving email using multiple preservation formats
☆52Nov 24, 2025Updated 7 months ago
nla / outbackcdx
View on GitHub
Web archive index server based on RocksDB
☆43Jul 9, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
webrecorder / replayweb.page
View on GitHub
Serverless replay of web archives directly in the browser
☆965Jul 13, 2026Updated last week
ukwa / ukwa-pywb
View on GitHub
☆11Nov 21, 2025Updated 8 months ago
webrecorder / browsertrix-crawler
View on GitHub
Run a high-fidelity browser-based web archiving crawler in a single Docker container
☆1,088Updated this week
WASAPI-Community / data-transfer-apis
View on GitHub
WASAPI data transfer APIs
☆50Apr 23, 2022Updated 4 years ago
unt-libraries / py-wasapi-client
View on GitHub
A client for the Archive-It And Webrecorder WASAPI Data Transfer API
☆16Oct 18, 2019Updated 6 years ago
webrecorder / behaviors
View on GitHub
Webrecorder Automated In-Page Behavior Framework
☆13Apr 21, 2021Updated 5 years ago
iipc / urlcanon
View on GitHub
url canonicalization library for python and java
☆43May 22, 2022Updated 4 years ago
webrecorder / warcit
View on GitHub
Convert Directories, Files and ZIP Files to Web Archives (WARC)
☆99Apr 22, 2025Updated last year
webrecorder / cdxj-indexer
View on GitHub
CDXJ Indexing of WARC/ARCs
☆35May 11, 2026Updated 2 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
maturban / WARCMerge
View on GitHub
Merging WARCs into a single WARC file
☆15Aug 29, 2014Updated 11 years ago
peterk / munin-indexer
View on GitHub
A social media open post web archiving tool
☆26Feb 4, 2026Updated 5 months ago
harvard-lil / waczerciser
View on GitHub
Create and edit WARC and WACZ files
☆29Dec 6, 2024Updated last year
atomotic / archiviiify
View on GitHub
Download digitized books from Internet Archive and view with IIIF, locally and offline.
☆39Apr 19, 2024Updated 2 years ago
webrecorder / warcio
View on GitHub
Streaming WARC/ARC library for fast web archive IO
☆461Jun 10, 2026Updated last month
webrecorder / wabac.js
View on GitHub
wabac.js - Web Archive Browsing Augmentation Client
☆127Jul 9, 2026Updated last week
webrecorder / warcio.js
View on GitHub
JS Streaming WARC IO optimized for Browser and Node
☆55Mar 25, 2026Updated 3 months ago
harvard-lil / warcgames
View on GitHub
Hacking challenges to learn web archive security.
☆35Jun 23, 2017Updated 9 years ago
harvard-lil / wacz-exhibitor
View on GitHub
Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.
☆44Nov 24, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Rhizome-Conifer / conifer-deploy
View on GitHub
Conifer setup and deployment via Ansible
☆12Jun 15, 2020Updated 6 years ago
webis-de / wasp
View on GitHub
☆28Jun 30, 2026Updated 3 weeks ago
webrecorder / browsertrix
View on GitHub
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …
☆442Updated this week
webrecorder / markdown-to-respec
View on GitHub
A Github Action for turning Markdown into ReSpec HTML
☆16Jun 6, 2024Updated 2 years ago
webrecorder / browsertrix-old
View on GitHub
Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System
☆87Feb 16, 2021Updated 5 years ago
webrecorder / archiveweb.page-site
View on GitHub
The ArchiveWeb.page Site
☆32May 28, 2026Updated last month
islandora-interest-groups / Islandora-IR-Interest-Group
View on GitHub
An IG focused on improving Islandora as an IR platform
☆13Jan 19, 2023Updated 3 years ago
iipc / warc-specifications
View on GitHub
Centralised repository for WARC usage specifications.
☆129Apr 4, 2026Updated 3 months ago
internetarchive / arch
View on GitHub
Web application for distributed compute analysis of Archive-It web archive collections.
☆20Mar 24, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
internetarchive / trough
View on GitHub
Trough: Big data, small databases.
☆43Jul 25, 2024Updated last year
webrecorder / pywb
View on GitHub
Core Python Web Archiving Toolkit for replay and recording of web archives
☆1,682Apr 10, 2026Updated 3 months ago
esmero / archipelago-documentation
View on GitHub
Archipelago Commons' ever evolving Documentation Repository
☆26Jun 23, 2026Updated 3 weeks ago
webrecorder / archiveweb.page
View on GitHub
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
☆1,525Jul 9, 2026Updated last week
sul-dlss / marctable
View on GitHub
A command line utility for converting MARC to CSV (and Parquet, etc)
☆29Apr 20, 2026Updated 3 months ago
NationalLibraryOfNorway / warchaeology
View on GitHub
Command line tool for digging into WARC files
☆50Updated this week
ikreymer / webarchive-indexing
View on GitHub
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
☆46Dec 4, 2017Updated 8 years ago