Wayback Machine API interface & a command-line tool
☆572Feb 26, 2024Updated 2 years ago
Alternatives and similar repositories for waybackpy
Users that are interested in waybackpy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Python API to the Internet Archive Wayback Machine☆86Apr 9, 2026Updated last week
- A Tool To Push Web Resources Into Web Archives☆432Jan 23, 2024Updated 2 years ago
- A Python script to submit web pages to the Wayback Machine for archiving.☆85Jan 23, 2026Updated 2 months ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Oct 5, 2024Updated last year
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆192Mar 30, 2026Updated 2 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Core Python Web Archiving Toolkit for replay and recording of web archives☆1,643Apr 10, 2026Updated last week
- Ruby gem to send URLs to Wayback Machine☆61Updated this week
- Command line tool for digging into WARC files☆51Apr 10, 2026Updated last week
- A toolkit makes it easier to archive webpages to IPFS☆13Jul 31, 2023Updated 2 years ago
- A Python and Command-Line Interface to Archive.org☆1,850Updated this week
- ☆17Mar 31, 2025Updated last year
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 6 years ago
- A command line utility for listing and searching snapshots in web archives☆17Dec 21, 2023Updated 2 years ago
- An Awesome List for getting started with web archiving☆2,520Mar 18, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Download the entire Wayback Machine archive for a given URL.☆3,178Apr 21, 2025Updated 11 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆57Apr 7, 2026Updated last week
- Web archiving using Google Chrome☆45Dec 30, 2019Updated 6 years ago
- Specifications developed and maintained by the Webrecorder community.☆141Oct 16, 2025Updated 6 months ago
- A PDF classifier ensemble with REST API service☆23Mar 5, 2021Updated 5 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- OSINT tool to download archived PDF files from archive.org for a given website.☆54Jun 20, 2020Updated 5 years ago
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns☆1,564May 23, 2025Updated 10 months ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆141Apr 3, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Archiveror will help you preserve the webpages you love. 💾☆457Oct 18, 2019Updated 6 years ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆1,020Updated this week
- A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆475Feb 23, 2024Updated 2 years ago
- Template for new OSINT command-line tools☆75Nov 25, 2024Updated last year
- ☁️ Curated Cloud OSINT resources — dorks, tools, and techniques for AWS, Azure, GCP, Oracle Cloud, and other major providers reconnaissan…☆119Apr 8, 2026Updated last week
- Script to automate, when possible, the passive reconnaissance performed on a website prior to an assessment.☆39Updated this week
- A simple Python wrapper for the archive.is capturing service☆217Feb 11, 2025Updated last year
- Home of the official docker image for ArchiveBox☆57Dec 18, 2024Updated last year
- Python framework for manipulating bulk WHOIS data from RIRs☆22Jan 8, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Archived tweets from the Wayback Machine☆188May 26, 2025Updated 10 months ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆57Aug 15, 2024Updated last year
- Streaming WARC/ARC library for fast web archive IO☆453Apr 6, 2026Updated last week
- Web archive index server based on RocksDB☆38Apr 1, 2026Updated 2 weeks ago
- Converts WARC files to static HTML☆52Sep 18, 2025Updated 7 months ago
- 🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and mor…☆27,248Updated this week
- An archiving tool with an IM-style interface that prioritizes privacy and accessibility, integrated with various archival services includ…☆2,174Apr 12, 2026Updated last week