akamhy / waybackpy
Wayback Machine API interface & a command-line tool
☆505Updated 11 months ago
Alternatives and similar repositories for waybackpy:
Users that are interested in waybackpy are comparing it to the libraries listed below
- A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆431Updated 11 months ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆174Updated 4 months ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆712Updated this week
- Retrieves archived tweets from Wayback Machine in HTML, CSV, and JSON☆95Updated last week
- A quick way to gather all the metadata about a video, playlist, or channel from the YouTube API.☆398Updated 2 weeks ago
- Automatically archive links to videos, images, and social media content from Google Sheets (and more).☆602Updated this week
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆112Updated last year
- A Tool To Push Web Resources Into Web Archives☆416Updated last year
- Yet another googlesearch - A Python library for executing intelligent, realistic-looking, and tunable Google searches.☆269Updated 10 months ago
- These tweets display several bad actors' most divisive uses of the Twitter platform.☆50Updated 2 years ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆117Updated last week
- A definitive guide to generating usernames for OSINT purposes☆158Updated 8 months ago
- Extract web archive data using Wayback Machine and Common Crawl☆151Updated 3 months ago
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns☆1,449Updated 7 months ago
- Automate downloading archived deleted Tweets.☆183Updated last year
- memory.lol☆577Updated last month
- URLTeam's second generation of URL shortener archiving tools☆75Updated last month
- This tool downloads each page from the Wayback Machine for a specific domain and enables further keyword search on each saved page.☆169Updated 9 months ago
- A Python and Command-Line Interface to Archive.org☆1,668Updated this week
- Tool for the retrieval of corporate and financial data from the SEC☆150Updated last month
- Estimating the age of web resources☆93Updated last year
- 🌐 List of free and downloadable top 1M domain list (alexa alternatives) 📊☆169Updated 6 months ago
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆148Updated 2 weeks ago
- Scrape VK URLs to fetch info and media - python API or command line tool.☆49Updated last month
- Visualise networks of companies, officers and addresses connected through UK Companies House☆60Updated 3 months ago
- Uncover the full name of a target on Linkedin.☆158Updated 2 years ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machine☆166Updated last month
- ☆166Updated this week
- Python library and command line tool for collecting JSON data from Gab.ai. Scrape posts, users and comments from "free-speech" social med…☆36Updated 2 years ago
- Advanced Search for Twitter.☆1,328Updated last year