ArchiveTeam / seesaw-kit
Making a reusable toolkit for writing seesaw scripts
☆70Updated last year
Alternatives and similar repositories for seesaw-kit:
Users that are interested in seesaw-kit are comparing it to the libraries listed below
- Boot scripts for the ArchiveTeam Warrior 2☆25Updated 6 years ago
- URLTeam's second generation of URL shortener archiving tools☆75Updated last month
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.☆88Updated 4 years ago
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved☆27Updated 7 months ago
- The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project☆27Updated 6 months ago
- Saves proxied HTTP traffic to a WARC file.☆27Updated 11 years ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) format☆51Updated 6 years ago
- An evil web server.☆13Updated 9 years ago
- An HTTP-based warc-to-zip converter☆11Updated 11 years ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆92Updated 4 years ago
- Saving all questions and answers from Yahoo! Answers.☆50Updated 3 years ago
- Take advantage of Flickr's new terabyte storage limit by turning it into a bad network filesystem with FUSE☆82Updated 11 years ago
- Web archiving using Google Chrome☆44Updated 5 years ago
- Witches Town extended extended informations☆12Updated 6 years ago
- Archiving Google+.☆24Updated 5 years ago
- Photobucket image and album extractor. Improved version of PB_Shovel by Daxda, to scrape urls.☆11Updated 5 years ago
- Reduce annoying 404 pages by automatically checking for an archived copy in the Wayback Machine. Learn more about this Test Pilot experim…☆56Updated 6 years ago
- The STiki anti-damage tool for wikis/Wikipedia☆22Updated 6 years ago
- This is why nobody ever encrypts anything☆30Updated 7 years ago
- Archiving all to-be-deleted NSFW tumblr blogs.☆49Updated 6 years ago
- Hyperboria (CJDNS network) map☆63Updated 5 years ago
- Documentation for the Internet Archive S3 API☆72Updated 7 years ago
- 🗄 Bot powering the @LinkArchiver Twitter tool to send tweeted URLs to the Wayback Machine☆46Updated 7 years ago
- Quick and flexible irc bot, extensible in any language☆52Updated 9 months ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆115Updated last month
- Decentralized web Gateway for Internet Archive☆21Updated 5 years ago
- Fuuka Imageboard Archiver☆57Updated 5 years ago
- Dark, minimalist, type-focused theme for The Lounge.☆15Updated 7 years ago
- An investigation into Jacob Appelbaum leaving the Tor Project☆96Updated 9 months ago
- A simple spec for hostname-safe HTTP content addressing☆39Updated 9 years ago