A Dockerfile for the ArchiveTeam Warrior
☆421Feb 11, 2026Updated 2 weeks ago
Alternatives and similar repositories for warrior-dockerfile
Users that are interested in warrior-dockerfile are comparing it to the libraries listed below
Sorting:
- ArchiveBot, an IRC bot for archiving websites☆406Aug 6, 2025Updated 6 months ago
- Making a reusable toolkit for writing seesaw scripts☆74Jan 21, 2026Updated last month
- Saving all questions and answers from Yahoo! Answers.☆50May 4, 2021Updated 4 years ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Jul 11, 2025Updated 7 months ago
- 😇 A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, B…☆386May 19, 2025Updated 9 months ago
- Archiving parts of the US government.☆29Dec 18, 2025Updated 2 months ago
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns☆1,554May 23, 2025Updated 9 months ago
- Wget-compatible web downloader and crawler.☆600Apr 29, 2024Updated last year
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆19Nov 25, 2025Updated 3 months ago
- Archiving all to-be-deleted NSFW tumblr blogs.☆52Dec 23, 2018Updated 7 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆58Aug 15, 2024Updated last year
- Use yt-dlp to download video/metadata and upload to the Internet Archive.☆479Feb 20, 2026Updated last week
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.☆16Aug 1, 2025Updated 7 months ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Oct 19, 2020Updated 5 years ago
- A configurable, reusable tracker with dashboard☆36Dec 15, 2023Updated 2 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆132Feb 19, 2026Updated last week
- Scripts to build and boot warrior virtual machine containing Docker☆122Apr 6, 2025Updated 10 months ago
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.☆92Jul 13, 2020Updated 5 years ago
- Darwin Foundation (CoreFoundation & PureFoundation)☆17Jul 9, 2018Updated 7 years ago
- Read and write WARC files in Go☆49Feb 13, 2026Updated 2 weeks ago
- UI front-end for YTP+☆12Jul 24, 2019Updated 6 years ago
- Archiving Google+.☆26Apr 4, 2019Updated 6 years ago
- A Python and Command-Line Interface to Archive.org☆1,839Feb 24, 2026Updated last week
- WARC writing MITM HTTP/S proxy☆445Feb 3, 2026Updated last month
- Chrome extension to "Create WARC files from any webpage"☆228Dec 5, 2025Updated 2 months ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Feb 2, 2024Updated 2 years ago
- My first stab at a Minecraft-enabled firmware for the ESP8266☆29Aug 11, 2020Updated 5 years ago
- A web UI for rTorrent, qBittorrent and Transmission with a Node.js backend and React frontend. Migrate to v4: https://github.com/jesec/fl…☆1,802May 30, 2021Updated 4 years ago
- Nondestructive warc-in-tar to warc conversion☆27Apr 21, 2013Updated 12 years ago
- brozzler - distributed browser-based web crawler☆788Feb 24, 2026Updated last week
- Python library for reading and writing warc files☆248Mar 7, 2022Updated 3 years ago
- NearTalk is chat platform to talk to people nearby.☆23Dec 20, 2023Updated 2 years ago
- An Awesome List for getting started with web archiving☆2,489Jan 19, 2026Updated last month
- Hexley is the mascot for Apple's open source operating system Darwin. Jon Hooper created the design which was then named after Darwin's a…☆25Jul 17, 2015Updated 10 years ago
- Archiving GitHub☆11Aug 5, 2025Updated 6 months ago
- Use linux on a laptop and use xrandr to use projectors or external monitors? This is just a simple script that takes no arguments and doe…☆16Dec 15, 2023Updated 2 years ago
- Pattern weaving sequencer for norns.☆10Jan 9, 2021Updated 5 years ago
- Manage your own Git server from the command line☆14May 18, 2018Updated 7 years ago
- 🖥️ Custom Flask + Jinja2 static site generator and content powering Monadical.com☆11Updated this week