akamhy / waybackpyLinks
Wayback Machine API interface & a command-line tool
☆544Updated last year
Alternatives and similar repositories for waybackpy
Users that are interested in waybackpy are comparing it to the libraries listed below
Sorting:
- A Python API to the Internet Archive Wayback Machine☆76Updated last week
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆181Updated 10 months ago
- A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆453Updated last year
- A quick way to gather all the metadata about a video, playlist, or channel from the YouTube API.☆435Updated 5 months ago
- Archived tweets from the Wayback Machine☆129Updated 2 months ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆853Updated this week
- Automatically archive links to videos, images, and social media content from Google Sheets (and more).☆872Updated last week
- Yet another googlesearch - A Python library for executing intelligent, realistic-looking, and tunable Google searches.☆283Updated last year
- Estimating the age of web resources☆96Updated 2 months ago
- Tool for extracting comments or subtitles from youtube video's☆145Updated 3 years ago
- Automate downloading archived deleted Tweets.☆186Updated 2 years ago
- Web-tool to search YouTube for geographically tagged videos by channel, topic, and location. Videos are viewable in a map and exportabled…☆144Updated 7 months ago
- A Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.☆116Updated last year
- A helper library full of URL-related heuristics.☆70Updated 2 months ago
- Tool for the retrieval of corporate and financial data from the SEC☆181Updated 3 months ago
- Node module to generate likely aliases for a given human name☆22Updated 2 years ago
- A webmining CLI tool & library for python.☆333Updated 2 months ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆313Updated last week
- Provides tools to analyze hashtags within posts scraped from TikTok.☆345Updated last year
- Scrape VK URLs to fetch info and media - python API or command line tool.☆52Updated 3 months ago
- A definitive guide to generating usernames for OSINT purposes☆164Updated last year
- Extract web archive data using Wayback Machine and Common Crawl☆159Updated 9 months ago
- An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a Tweets and more whil…☆187Updated 2 years ago
- Easy to deploy API for transcribing and translating audio / video using OpenAI's whisper model.☆70Updated last year
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆25Updated 4 years ago
- The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords☆75Updated 2 months ago
- Multithreading requests via TOR with automatic TOR new identity☆59Updated 2 years ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆162Updated this week
- Tool and library for handling Web ARChive (WARC) files.☆163Updated 10 months ago
- Want to contribute? These are difficult, long-term projects that could be valuable to open source investigators at Bellingcat and around …☆354Updated last year