Famicoman / ia-collection-dl
Downloads an entire Internet Archive collection
☆31Updated 5 years ago
Alternatives and similar repositories for ia-collection-dl:
Users that are interested in ia-collection-dl are comparing it to the libraries listed below
- Tool and library for handling Web ARChive (WARC) files.☆153Updated 3 months ago
- Basic python script to list following and followed blogs on Tumblr☆19Updated 10 years ago
- A GUI based gopher (protocol) client☆22Updated 5 years ago
- Modular workflow assistant for book digitization☆18Updated 8 years ago
- dosage is a comic strip downloader and archiver☆51Updated 5 years ago
- Documentation for the Internet Archive S3 API☆72Updated 6 years ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆153Updated 4 years ago
- Photobucket image and album extractor. Improved version of PB_Shovel by Daxda, to scrape urls.☆11Updated 5 years ago
- Estimate website size.☆84Updated 11 months ago
- Wget with Lua extension☆23Updated 9 years ago
- Itabashi (板橋) is a bridging bot that syncs messages between a Discord and an IRC channel.☆19Updated 6 years ago
- Serving content from a WARC☆61Updated 12 years ago
- A Python script for saving your Tumblr blog to your hard drive as HTML or CSV.☆89Updated 7 years ago
- Bash script to force the first page of items on the Internet Archive to be the default.☆15Updated 5 years ago
- a list of disposable and temporary email address domains☆8Updated 6 years ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆56Updated 6 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆83Updated last month
- Easily archive important Reddit post threads onto your computer☆57Updated 2 years ago
- Nondestructive warc-in-tar to warc conversion☆26Updated 11 years ago
- Google Books Downloader / Image Scraper☆53Updated 5 years ago
- Recover lost websites from the Web Infrastructure☆87Updated 3 years ago
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.☆88Updated 4 years ago
- Issuu scraper written in Python.☆16Updated 5 years ago
- Converts a Yahoo group archive created by yahoo-group-archiver into standalone email, mbox folders, and PDF files☆22Updated 3 years ago
- A command line tool that reads a HyperCard stack and generates a folder with XML and PBM files from it containing a more easily readable …☆41Updated 2 years ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆91Updated 3 years ago
- It's all about setting limits for yourself.☆15Updated 11 years ago
- Converts WARC files to static HTML☆43Updated 6 months ago
- Plowshare module for mega.co.nz☆24Updated 5 years ago