internetarchive/wayback

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/internetarchive/wayback)

internetarchive / wayback

IA's public Wayback Machine (moved from SourceForge)

☆850

Alternatives and similar repositories for wayback

Users that are interested in wayback are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iipc / openwayback
View on GitHub
The OpenWayback Development
☆522Jan 3, 2024Updated 2 years ago
webrecorder / pywb
View on GitHub
Core Python Web Archiving Toolkit for replay and recording of web archives
☆1,683Apr 10, 2026Updated 3 months ago
jjjake / internetarchive
View on GitHub
A Python and Command-Line Interface to Archive.org
☆1,886Updated this week
internetarchive / CDX-Writer
View on GitHub
Python script to create CDX index files of WARC data
☆22Sep 4, 2025Updated 10 months ago
internetarchive / brozzler
View on GitHub
brozzler - distributed browser-based web crawler
☆809Jul 7, 2026Updated 2 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
internetarchive / wayback-diff
View on GitHub
React components to render differences between captures at the Wayback Machine
☆43Jul 6, 2026Updated 2 weeks ago
hartator / wayback-machine-downloader
View on GitHub
Download an entire website from the Wayback Machine.
☆5,909Feb 8, 2024Updated 2 years ago
helgeho / Web2Warc
View on GitHub
An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)
☆26Oct 9, 2017Updated 8 years ago
internetarchive / heritrix3
View on GitHub
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.
☆3,283Jul 15, 2026Updated last week
jsvine / waybackpack
View on GitHub
Download the entire Wayback Machine archive for a given URL.
☆3,217Apr 21, 2025Updated last year
iipc / awesome-web-archiving
View on GitHub
An Awesome List for getting started with web archiving
☆2,607Apr 27, 2026Updated 2 months ago
mekarpeles / math.mx
View on GitHub
A comprehensive graph of mathematical domains and topics
☆24Jan 8, 2022Updated 4 years ago
internetarchive / liveweb
View on GitHub
Liveweb proxy of the Wayback Machine project
☆43Apr 20, 2021Updated 5 years ago
internetarchive / warcprox
View on GitHub
WARC writing MITM HTTP/S proxy
☆456Jun 17, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
internetarchive / wayback-machine-firefox
View on GitHub
Reduce annoying 404 pages by automatically checking for an archived copy in the Wayback Machine. Learn more about this Test Pilot experim…
☆59Dec 2, 2018Updated 7 years ago
thegroovebox / groovebox.org
View on GitHub
Spotify clone for the Internet Archive's Music Library
☆15Aug 19, 2020Updated 5 years ago
internetarchive / warctools
View on GitHub
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
☆176Aug 18, 2025Updated 11 months ago
internetarchive / warc
View on GitHub
Python library for reading and writing warc files
☆249Mar 7, 2022Updated 4 years ago
JustAnotherArchivist / archivebot-archives
View on GitHub
☆15Nov 5, 2018Updated 7 years ago
Rhizome-Conifer / conifer
View on GitHub
Collect and revisit web pages.
☆1,542Updated this week
ArchiveLabs / api.archivelab.org
View on GitHub
Archive.org API Server
☆39Nov 1, 2023Updated 2 years ago
internetarchive / surt
View on GitHub
Sort-friendly URI Reordering Transform (SURT) python module
☆45Sep 11, 2025Updated 10 months ago
ArchiveTeam / grab-site
View on GitHub
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
☆1,602May 23, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ArchiveTeam / ArchiveBot
View on GitHub
ArchiveBot, an IRC bot for archiving websites
☆419Apr 17, 2026Updated 3 months ago
machawk1 / wail
View on GitHub
Web Archiving Integration Layer: One-Click User Instigated Preservation
☆398Jun 19, 2026Updated last month
internetarchive / sandcrawler
View on GitHub
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki
☆28Jul 31, 2024Updated last year
ArchiveTeam / wpull
View on GitHub
Wget-compatible web downloader and crawler.
☆613Apr 29, 2024Updated 2 years ago
oduwsdl / MemGator
View on GitHub
A Memento Aggregator CLI and Server in Go
☆80Apr 9, 2026Updated 3 months ago
sangaline / wayback-machine-scraper
View on GitHub
A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
☆480Feb 23, 2024Updated 2 years ago
internetarchive / umbra
View on GitHub
A queue-controlled browser automation tool for improving web crawl quality
☆68May 28, 2026Updated last month
vinaygoel / ars-workshop
View on GitHub
Archive Research Services Workshop
☆31Sep 29, 2017Updated 8 years ago
hrbrmstr / wayback
View on GitHub
Tools to Work with the Various Internet Archive Wayback Machine APIs
☆55Sep 18, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
internetarchive / bookreader
View on GitHub
The Internet Archive BookReader
☆1,160Updated this week
webrecorder / wombat
View on GitHub
Wombat.js client-side rewriting library
☆122Updated this week
ArchiveTeam / youtube-grab
View on GitHub
Archiving all metadata from YouTube (everything except videos themselves due to size)
☆34Jul 17, 2026Updated last week
WASAPI-Community / data-transfer-apis
View on GitHub
WASAPI data transfer APIs
☆50Apr 23, 2022Updated 4 years ago
internetarchive / wayback-machine-webextension
View on GitHub
A web browser extension for Chrome, Firefox, Edge, and Safari 14.
☆843Jun 21, 2026Updated last month
ArchiveTeam / urls-grab
View on GitHub
Archiving URLs (outlinks) from a variety of sources.
☆25Jun 26, 2026Updated 3 weeks ago
iipc / warc-specifications
View on GitHub
Centralised repository for WARC usage specifications.
☆129Apr 4, 2026Updated 3 months ago