webrecorder/browsertrix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/webrecorder/browsertrix)

webrecorder / browsertrix

Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!

☆442

Alternatives and similar repositories for browsertrix

Users that are interested in browsertrix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

webrecorder / browsertrix-crawler
View on GitHub
Run a high-fidelity browser-based web archiving crawler in a single Docker container
☆1,088Updated this week
webrecorder / replayweb.page
View on GitHub
Serverless replay of web archives directly in the browser
☆965Jul 13, 2026Updated last week
NationalLibraryOfNorway / warchaeology
View on GitHub
Command line tool for digging into WARC files
☆50Updated this week
webrecorder / archiveweb.page
View on GitHub
A High-Fidelity Web Archiving Extension for Chrome and Chromium based browsers!
☆1,525Jul 9, 2026Updated last week
harvard-lil / scoop
View on GitHub
🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.
☆205Sep 3, 2025Updated 10 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
webrecorder / py-wacz
View on GitHub
☆61Apr 11, 2024Updated 2 years ago
ukwa / ukwa-pywb
View on GitHub
☆11Nov 21, 2025Updated 8 months ago
webrecorder / pywb-remote-browsers
View on GitHub
Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives
☆16Jun 10, 2021Updated 5 years ago
webrecorder / browsertrix-behaviors
View on GitHub
Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.
☆58Updated this week
webrecorder / specs
View on GitHub
Specifications developed and maintained by the Webrecorder community.
☆142Oct 16, 2025Updated 9 months ago
webrecorder / wabac.js
View on GitHub
wabac.js - Web Archive Browsing Augmentation Client
☆127Jul 9, 2026Updated last week
webrecorder / markdown-to-respec
View on GitHub
A Github Action for turning Markdown into ReSpec HTML
☆16Jun 6, 2024Updated 2 years ago
webrecorder / cdxj-indexer
View on GitHub
CDXJ Indexing of WARC/ARCs
☆35May 11, 2026Updated 2 months ago
iipc / warcaroo
View on GitHub
☆18Apr 29, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
harvard-lil / waczerciser
View on GitHub
Create and edit WARC and WACZ files
☆29Dec 6, 2024Updated last year
digipres / awesome-digital-preservation
View on GitHub
Carefully curated list of awesome digital preservation resources.
☆139Aug 1, 2025Updated 11 months ago
chfoo / warcat-rs
View on GitHub
Command-line tool and Rust library for handling Web ARChive (WARC) files
☆33Jun 2, 2025Updated last year
ArchiveBox / DigestBox
View on GitHub
DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…
☆21Feb 2, 2024Updated 2 years ago
webrecorder / public-web-archives
View on GitHub
A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format
☆55Dec 5, 2022Updated 3 years ago
webrecorder / warcit
View on GitHub
Convert Directories, Files and ZIP Files to Web Archives (WARC)
☆99Apr 22, 2025Updated last year
lcnetdev / PREMIS
View on GitHub
☆10Apr 26, 2026Updated 2 months ago
webrecorder / oembed.link
View on GitHub
A Cloudflare Worker to render embeds on a single page using oEmbed
☆25Nov 17, 2022Updated 3 years ago
machawk1 / wail
View on GitHub
Web Archiving Integration Layer: One-Click User Instigated Preservation
☆398Jun 19, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ArchiveTeam / grab-site
View on GitHub
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
☆1,601May 23, 2025Updated last year
Own-Data-Privateer / hoardy-web
View on GitHub
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…
☆131Jul 13, 2026Updated last week
BitCurator / bitcurator-nlp-gentm
View on GitHub
Generate topic models from open text extracted from files in disk images
☆10Apr 11, 2023Updated 3 years ago
harvard-lil / warcbench
View on GitHub
A tool for exploring, analyzing, transforming, recombining, and extracting data from WARC (Web ARChive) files.
☆22Jul 30, 2025Updated 11 months ago
openpreserve / ViPER
View on GitHub
Dutch Digital Heritage Network virtual research environment set up and provisioning
☆18Updated this week
anjackson / sliver
View on GitHub
A tool for collection archival slivers of the web and web archives
☆19Jun 1, 2026Updated last month
ArchiveBox / abx-spec-behaviors
View on GitHub
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…
☆20Jul 11, 2025Updated last year
harvard-lil / wacz-exhibitor
View on GitHub
Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.
☆44Nov 24, 2025Updated 7 months ago
Rhizome-Conifer / conifer
View on GitHub
Collect and revisit web pages.
☆1,542May 12, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
wabarc / wayback
View on GitHub
An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services includ…
☆2,219Updated this week
MaastrichtU-Library / omekas-docker
View on GitHub
Dockerized development environment for Omeka S
☆14Jul 9, 2026Updated last week
webrecorder / web-archive-site-mirror
View on GitHub
☆17Apr 16, 2026Updated 3 months ago
webrecorder / browsertrix-old
View on GitHub
Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System
☆87Feb 16, 2021Updated 5 years ago
APTrust / dart
View on GitHub
Create bags based on BagIt profiles and send them off into the ether (EasyStore is now DART)
☆62Updated this week
keeps / roda
View on GitHub
RODA - Repository of Authentic Digital Objects
☆101Updated this week
harvard-lil / thread-keeper
View on GitHub
(Experimental) High-fidelity capture of Twitter threads as sealed PDFs.
☆55Dec 4, 2023Updated 2 years ago