webrecorder/browsertrix-old

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/webrecorder/browsertrix-old)

webrecorder / browsertrix-old

Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior System

☆87

Alternatives and similar repositories for browsertrix-old

Users that are interested in browsertrix-old are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Rhizome-Conifer / conifer-deploy
View on GitHub
Conifer setup and deployment via Ansible
☆12Jun 15, 2020Updated 6 years ago
webrecorder / behaviors
View on GitHub
Webrecorder Automated In-Page Behavior Framework
☆13Apr 21, 2021Updated 5 years ago
harvard-lil / waczerciser
View on GitHub
Create and edit WARC and WACZ files
☆29Dec 6, 2024Updated last year
ikreymer / webarchive-indexing
View on GitHub
Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.
☆46Dec 4, 2017Updated 8 years ago
anjackson / sliver
View on GitHub
A tool for collection archival slivers of the web and web archives
☆19Jun 1, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ukwa / ukwa-pywb
View on GitHub
☆11Nov 21, 2025Updated 8 months ago
N0taN3rd / Squidwarc
View on GitHub
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
☆178May 19, 2020Updated 6 years ago
edsu / memento-cli
View on GitHub
A command line utility for listing and searching snapshots in web archives
☆20Jun 4, 2026Updated last month
edsu / whisper-transcript
View on GitHub
A Lit web-component for viewing a Whisper JSON transcript file
☆14Feb 12, 2026Updated 5 months ago
DocNow / awesome-social-media-archiving
View on GitHub
Tools for helping you work with web platform archive downloads.
☆18Mar 27, 2020Updated 6 years ago
vphill / web-archiving-course
View on GitHub
Web Archiving Course
☆23Mar 4, 2024Updated 2 years ago
webrecorder / warcit
View on GitHub
Convert Directories, Files and ZIP Files to Web Archives (WARC)
☆99Apr 22, 2025Updated last year
webrecorder / public-web-archives
View on GitHub
A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format
☆55Dec 5, 2022Updated 3 years ago
webrecorder / webrecorder-desktop
View on GitHub
Webrecorder Desktop App!
☆205Feb 16, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
harvard-lil / js-wacz
View on GitHub
JavaScript module and CLI tool for working with web archive data using the WACZ format specification.
☆17Mar 11, 2025Updated last year
webrecorder / browsertrix-behaviors
View on GitHub
Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.
☆58Updated this week
NationalLibraryOfNorway / warchaeology
View on GitHub
Command line tool for digging into WARC files
☆50Updated this week
webrecorder / wabac.js
View on GitHub
wabac.js - Web Archive Browsing Augmentation Client
☆127Jul 9, 2026Updated last week
webrecorder / pywb-remote-browsers
View on GitHub
Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives
☆16Jun 10, 2021Updated 5 years ago
N0taN3rd / node-warc
View on GitHub
Parse And Create Web ARChive (WARC) files with node.js
☆105Jan 29, 2025Updated last year
WASAPI-Community / data-transfer-apis
View on GitHub
WASAPI data transfer APIs
☆50Apr 23, 2022Updated 4 years ago
DocNow / waybackprov
View on GitHub
utility to fetch provenance information from Internet Archive's Wayback Machine
☆15Feb 5, 2026Updated 5 months ago
webrecorder / markdown-to-respec
View on GitHub
A Github Action for turning Markdown into ReSpec HTML
☆16Jun 6, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ikreymer / browsertrix
View on GitHub
(Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…
☆38Apr 23, 2019Updated 7 years ago
harvard-lil / warcgames
View on GitHub
Hacking challenges to learn web archive security.
☆35Jun 23, 2017Updated 9 years ago
UAlbanyArchives / describingWebArchives
View on GitHub
Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs
☆11Aug 10, 2018Updated 7 years ago
webrecorder / warcio
View on GitHub
Streaming WARC/ARC library for fast web archive IO
☆461Jun 10, 2026Updated last month
iipc / warcaroo
View on GitHub
☆18Apr 29, 2026Updated 2 months ago
mjordan / GitBags
View on GitHub
Some ideas on making Bags into Git repositories
☆16Dec 23, 2014Updated 11 years ago
oduwsdl / ORS
View on GitHub
Object Resource Stream and CDXJ Drafts
☆15Nov 28, 2018Updated 7 years ago
oduwsdl / Reconstructive
View on GitHub
A ServiceWorker for client-side reconstruction of composite mementos
☆15Mar 6, 2025Updated last year
peterk / warcworker
View on GitHub
A dockerized, queued high fidelity web archiver based on Squidwarc
☆62Jul 9, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
mannau / boilerpipeR
View on GitHub
Interface to the boilerpipe Java library by Christian Kohlschutter (http://code.google.com/p/boilerpipe/)
☆21May 19, 2021Updated 5 years ago
webrecorder / py-wacz
View on GitHub
☆61Apr 11, 2024Updated 2 years ago
web-archive-group / heritrix-walkthrough
View on GitHub
☆10Jun 10, 2016Updated 10 years ago
DocNow / tweet-archive
View on GitHub
A tool for working with tweet archives.
☆15Jan 1, 2023Updated 3 years ago
utdata / csvkit-nicar
View on GitHub
A hands-on course for NICAR 2020 (and 2018)
☆11Mar 7, 2020Updated 6 years ago
gwu-libraries / TweetSets
View on GitHub
Service for creating Twitter datasets for research and archiving.
☆26Dec 7, 2022Updated 3 years ago
WILDERNESSWIRELESS / WILDERNESS-WIRELESS-I
View on GitHub
Repository for Radical Networks Workshop
☆14Nov 9, 2016Updated 9 years ago