Famicoman / ia-collection-dlLinks
Downloads an entire Internet Archive collection
☆33Updated 6 years ago
Alternatives and similar repositories for ia-collection-dl
Users that are interested in ia-collection-dl are comparing it to the libraries listed below
Sorting:
- Tool and library for handling Web ARChive (WARC) files.☆164Updated 11 months ago
- NOTE: This project is no longer being actively developed.. Check out https://replayweb.page / https://github.com/webrecorder/replayweb.pa…☆200Updated 8 months ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆165Updated last month
- Estimate website size.☆84Updated last year
- Grabbing all news.☆62Updated 5 years ago
- Documentation for the Internet Archive S3 API☆75Updated 7 years ago
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)☆448Updated 5 years ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆87Updated 5 months ago
- Basic python script to list following and followed blogs on Tumblr☆20Updated 10 years ago
- Uploads items into the Internet Archive after they have been downloaded with youtube-dl☆15Updated 10 years ago
- Web Archiving Integration Layer: One-Click User Instigated Preservation☆380Updated 6 months ago
- Wget-compatible web downloader and crawler.☆593Updated last year
- Modular workflow assistant for book digitization☆20Updated 9 years ago
- A collection of tools for archiving and analysing the internet.☆78Updated 3 years ago
- 💾 YouTube video metadata archiver written in Golang☆20Updated 5 years ago
- Nondestructive warc-in-tar to warc conversion☆27Updated 12 years ago
- Recover lost websites from the Web Infrastructure☆89Updated last month
- Easily archive important Reddit post threads onto your computer☆63Updated 3 years ago
- Boot scripts for the ArchiveTeam Warrior 2☆26Updated 2 months ago
- Serving content from a WARC☆62Updated 12 years ago
- Scrapes and archives a Yahoo groups email archives, photo galleries and file contents using the non-public API☆94Updated 5 years ago
- A Python script for saving your Tumblr blog to your hard drive as HTML or CSV.☆88Updated 8 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Making a reusable toolkit for writing seesaw scripts☆72Updated 2 years ago
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.☆92Updated 5 years ago
- Distributed crawler, database and web frontend for public directories indexing☆141Updated 5 years ago
- Wget with Lua extension☆24Updated 9 years ago
- One-Click User Instigated Preservation☆128Updated 6 years ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆61Updated last year
- Plowshare module for mega.co.nz☆25Updated 6 years ago