A Dockerfile for the ArchiveTeam Warrior
β424Mar 19, 2026Updated this week
Alternatives and similar repositories for warrior-dockerfile
Users that are interested in warrior-dockerfile are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ArchiveBot, an IRC bot for archiving websitesβ408Aug 6, 2025Updated 7 months ago
- π A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, Bβ¦β390May 19, 2025Updated 10 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from Aβ¦β19Nov 25, 2025Updated 3 months ago
- Bash script to force the first page of items on the Internet Archive to be the default.β15Jun 28, 2019Updated 6 years ago
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patternsβ1,561May 23, 2025Updated 10 months ago
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β57Aug 15, 2024Updated last year
- Scripts to build and boot warrior virtual machine containing Dockerβ122Apr 6, 2025Updated 11 months ago
- Use yt-dlp to download video/metadata and upload to the Internet Archive.β480Mar 15, 2026Updated last week
- Archiving all to-be-deleted NSFW tumblr blogs.β52Dec 23, 2018Updated 7 years ago
- A configurable, reusable tracker with dashboardβ36Dec 15, 2023Updated 2 years ago
- Archiving Google+.β26Apr 4, 2019Updated 6 years ago
- A specification for tor's ContactInfo field.β11Mar 9, 2026Updated 2 weeks ago
- π Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and morβ¦β27,093Mar 16, 2026Updated last week
- Darwin Foundation (CoreFoundation & PureFoundation)β17Jul 9, 2018Updated 7 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.β132Updated this week
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.β92Jul 13, 2020Updated 5 years ago
- β15Nov 5, 2018Updated 7 years ago
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β15Oct 19, 2020Updated 5 years ago
- mutant standard style graphics of real trainsβ14Aug 19, 2024Updated last year
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Feb 2, 2024Updated 2 years ago
- A ServiceWorker for client-side reconstruction of composite mementosβ16Mar 6, 2025Updated last year
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.β16Mar 16, 2026Updated last week
- Docker container for streaming RTL SDR to Broadcastifyβ15Sep 19, 2019Updated 6 years ago
- An Awesome List for getting started with web archivingβ19Dec 21, 2018Updated 7 years ago
- brozzler - distributed browser-based web crawlerβ791Mar 18, 2026Updated last week
- A web UI for rTorrent, qBittorrent and Transmission with a Node.js backend and React frontend. Migrate to v4: https://github.com/jesec/flβ¦β1,800May 30, 2021Updated 4 years ago
- Nondestructive warc-in-tar to warc conversionβ27Apr 21, 2013Updated 12 years ago
- Chrome extension to "Create WARC files from any webpage"β228Dec 5, 2025Updated 3 months ago
- An Awesome List for getting started with web archivingβ2,512Updated this week
- Systemd services to send notifications on system startup, shutdown and service failureβ15Sep 12, 2024Updated last year
- Ignore network module for ZNC 1.0β14Oct 23, 2022Updated 3 years ago
- InterPlanetary Wayback: A distributed and persistent archive replay system using IPFS