ArchiveTeam/ArchiveBot

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ArchiveTeam/ArchiveBot)

ArchiveTeam / ArchiveBot

ArchiveBot, an IRC bot for archiving websites

☆418

Alternatives and similar repositories for ArchiveBot

Users that are interested in ArchiveBot are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ArchiveTeam / wpull
View on GitHub
Wget-compatible web downloader and crawler.
☆612Apr 29, 2024Updated 2 years ago
ArchiveTeam / warrior-code2
View on GitHub
Boot scripts for the ArchiveTeam Warrior 2
☆26Jul 5, 2025Updated last year
ArchiveTeam / grab-site
View on GitHub
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
☆1,601May 23, 2025Updated last year
ArchiveTeam / NewsGrabber
View on GitHub
Grabbing all news.
☆60Dec 23, 2019Updated 6 years ago
ArchiveTeam / seesaw-kit
View on GitHub
Making a reusable toolkit for writing seesaw scripts
☆75Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ArchiveTeam / tumblr-grab
View on GitHub
Archiving all to-be-deleted NSFW tumblr blogs.
☆53Dec 23, 2018Updated 7 years ago
ArchiveTeam / terroroftinytown-client-grab
View on GitHub
The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project
☆28Jul 17, 2025Updated last year
ArchiveTeam / terroroftinytown
View on GitHub
URLTeam's second generation of URL shortener archiving tools
☆81Mar 12, 2026Updated 4 months ago
ArchiveTeam / reddit-grab
View on GitHub
Grabbing everything from reddit.
☆62Feb 16, 2024Updated 2 years ago
ArchiveTeam / warrior-dockerfile
View on GitHub
A Dockerfile for the ArchiveTeam Warrior
☆447Updated this week
ikreymer / webarchiveplayer
View on GitHub
NOTE: This project is no longer being actively developed.. Check out https://replayweb.page / https://github.com/webrecorder/replayweb.pa…
☆203Jan 22, 2025Updated last year
ArchiveTeam / universal-tracker
View on GitHub
A configurable, reusable tracker with dashboard
☆36Dec 15, 2023Updated 2 years ago
PromyLOPh / crocoite
View on GitHub
Web archiving using Google Chrome
☆45Dec 30, 2019Updated 6 years ago
iipc / openwayback
View on GitHub
The OpenWayback Development
☆522Jan 3, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ArchiveTeam / IA.BAK
View on GitHub
We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.
☆93Jul 13, 2020Updated 6 years ago
bibanon / tubeup
View on GitHub
Use yt-dlp to download video/metadata and upload to the Internet Archive.
☆509May 8, 2026Updated 2 months ago
recrm / ArchiveTools
View on GitHub
A collection of tools for archiving and analysing the internet.
☆79Jul 6, 2022Updated 4 years ago
ArchiveTeam / urls-grab
View on GitHub
Archiving URLs (outlinks) from a variety of sources.
☆25Jun 26, 2026Updated 3 weeks ago
internetarchive / brozzler
View on GitHub
brozzler - distributed browser-based web crawler
☆809Jul 7, 2026Updated 2 weeks ago
ArchiveTeam / github-grab
View on GitHub
Archiving GitHub
☆11Aug 5, 2025Updated 11 months ago
internetarchive / warcprox
View on GitHub
WARC writing MITM HTTP/S proxy
☆456Jun 17, 2026Updated last month
webrecorder / cdxj-indexer
View on GitHub
CDXJ Indexing of WARC/ARCs
☆35May 11, 2026Updated 2 months ago
internetarchive / warc
View on GitHub
Python library for reading and writing warc files
☆249Mar 7, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
alard / megawarc
View on GitHub
Nondestructive warc-in-tar to warc conversion
☆27Apr 21, 2013Updated 13 years ago
jjjake / internetarchive
View on GitHub
A Python and Command-Line Interface to Archive.org
☆1,884Updated this week
oduwsdl / archivenow
View on GitHub
A Tool To Push Web Resources Into Web Archives
☆434Jan 23, 2024Updated 2 years ago
palewire / archiveis
View on GitHub
A simple Python wrapper for the archive.is capturing service
☆219Feb 11, 2025Updated last year
iipc / awesome-web-archiving
View on GitHub
An Awesome List for getting started with web archiving
☆2,605Apr 27, 2026Updated 2 months ago
webrecorder / pywb
View on GitHub
Core Python Web Archiving Toolkit for replay and recording of web archives
☆1,682Apr 10, 2026Updated 3 months ago
ArchiveTeam / wget-lua
View on GitHub
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
☆137Mar 19, 2026Updated 4 months ago
palewire / savepagenow
View on GitHub
A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service
☆195Jun 17, 2026Updated last month
oduwsdl / MemGator
View on GitHub
A Memento Aggregator CLI and Server in Go
☆80Apr 9, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tempname1024 / allium
View on GitHub
Repository moved to https://git.jordan.im/allium/
☆19Jan 26, 2023Updated 3 years ago
Famicoman / ia-ul-from-youtubedl
View on GitHub
Uploads items into the Internet Archive after they have been downloaded with youtube-dl
☆15Feb 28, 2015Updated 11 years ago
ArchiveTeam / yahooanswers-grab
View on GitHub
Saving all questions and answers from Yahoo! Answers.
☆50May 4, 2021Updated 5 years ago
webrecorder / browsertrix-crawler
View on GitHub
Run a high-fidelity browser-based web archiving crawler in a single Docker container
☆1,088Updated this week
iipc / urlcanon
View on GitHub
url canonicalization library for python and java
☆43May 22, 2022Updated 4 years ago
web-archive-group / heritrix-walkthrough
View on GitHub
☆10Jun 10, 2016Updated 10 years ago
odie5533 / WarcMiddleware
View on GitHub
WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.
☆48Mar 19, 2018Updated 8 years ago