ArchiveTeam / urls-grab
Archiving URLs (outlinks) from a variety of sources.
☆21Updated this week
Alternatives and similar repositories for urls-grab:
Users that are interested in urls-grab are comparing it to the libraries listed below
- Archiving GitHub☆9Updated 4 months ago
- URLTeam's second generation of URL shortener archiving tools☆75Updated 3 months ago
- Archiving public telegram messages.☆12Updated 3 months ago
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved☆28Updated 9 months ago
- Archiving all metadata from YouTube (everything except videos themselves due to size)☆27Updated 3 months ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆93Updated 4 years ago
- A script for immunizing a google account for the effects of 13 September which will break some Google Drive Links☆11Updated 3 years ago
- A configurable, reusable tracker with dashboard☆34Updated last year
- Scrape https://unlistedvideos.com/☆14Updated 3 years ago
- The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project☆27Updated 8 months ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 6 months ago
- An S3-to-S3 proxy (and more) implementing file-level deduplication and access control.☆26Updated last year
- small GPL3+ Android app that allows you to log in to some Duo-protected services with a standard OTP app☆16Updated 4 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆119Updated 3 months ago
- Importable Firefox profile for use with i2p. Also usable with Tor Browser on most platforms.☆14Updated 5 years ago
- Web browser for embedded systems.☆16Updated 2 years ago
- ☆14Updated 2 years ago
- Home of the official apt/deb package for Ubuntu/Debian-based systems.☆17Updated 6 months ago
- A CLI client for Revolt.☆10Updated 3 years ago
- Rust program for extracting most URLs from Discord scrapes. Works with Discord History Tracker, discard2, and DiscordChatExporter.☆20Updated 3 months ago
- A searchable clone of russianplanes.net, for transparency and ease of identifying planes.☆10Updated 2 years ago
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆15Updated 3 years ago
- Grabbing everything from reddit.☆60Updated last year
- Liveweb proxy of the Wayback Machine project☆44Updated 3 years ago
- A small tool for brute forcing the access code of a Yubico Yubikey.☆20Updated 4 years ago
- trustor (PoC)☆25Updated 3 years ago
- A blacklist of IPs that don't like being scanned☆9Updated 7 years ago
- simple script to convert web resources to a single warc file☆21Updated last year
- Browser agnostic extension that enables pasting into password fields...☆15Updated 5 years ago
- Saving all questions and answers from Yahoo! Answers.☆50Updated 3 years ago