Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
☆133Mar 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for wget-lua
Users that are interested in wget-lua are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved☆30Sep 20, 2025Updated 6 months ago
- Making a reusable toolkit for writing seesaw scripts☆74Mar 13, 2026Updated 3 weeks ago
- Sources for urls-grab.☆13Apr 4, 2026Updated last week
- Wget-compatible web downloader and crawler.☆604Apr 29, 2024Updated last year
- Awesome list dedicated to digital and data preservation tools, sources, services and so on.☆35Mar 9, 2026Updated last month
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns☆1,558May 23, 2025Updated 10 months ago
- Wget with Lua extension☆24Dec 17, 2015Updated 10 years ago
- Command line tool for digging into WARC files☆51Mar 31, 2026Updated last week
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆1,016Updated this week
- Specifications developed and maintained by the Webrecorder community.☆140Oct 16, 2025Updated 5 months ago
- The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project☆28Jul 17, 2025Updated 8 months ago
- Nondestructive warc-in-tar to warc conversion☆27Apr 21, 2013Updated 12 years ago
- Browser agnostic extension that enables pasting into password fields...☆15Jan 12, 2020Updated 6 years ago
- Centralised repository for WARC usage specifications.☆125Apr 4, 2026Updated last week
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A Tool To Push Web Resources Into Web Archives☆432Jan 23, 2024Updated 2 years ago
- Create and edit WARC and WACZ files☆25Dec 6, 2024Updated last year
- Archiving URLs (outlinks) from a variety of sources.☆25Mar 27, 2026Updated 2 weeks ago
- Counting the collisions with perl hash tables per function☆12Jun 5, 2019Updated 6 years ago
- A configurable, reusable tracker with dashboard☆36Dec 15, 2023Updated 2 years ago
- novel high throughput entropy encoder for BWT data☆15Aug 10, 2022Updated 3 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆56Feb 10, 2026Updated 2 months ago
- Docker Container for grab-site☆13Aug 26, 2024Updated last year
- External link tracking tool for Wikimedia partnerships☆11Oct 3, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- 🚀 Generate & manage free Cloudflare WARP configs for WireGuard, Sing-Box & AmneziaWG. Your personal, fast, and secure VPN solution.☆30Nov 15, 2025Updated 4 months ago
- WARC writing MITM HTTP/S proxy☆448Apr 1, 2026Updated last week
- brozzler - distributed browser-based web crawler☆795Mar 26, 2026Updated 2 weeks ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆82Mar 30, 2026Updated last week
- ☆28Feb 13, 2026Updated last month
- Streaming WARC/ARC library for fast web archive IO☆452Updated this week
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Oct 19, 2020Updated 5 years ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆97Apr 22, 2025Updated 11 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ArchiveBot, an IRC bot for archiving websites☆408Aug 6, 2025Updated 8 months ago
- Homographs: brutefind homographs within a font☆19Apr 21, 2017Updated 8 years ago
- 🖥️ Custom Flask + Jinja2 static site generator and content powering Monadical.com☆10Mar 10, 2026Updated last month
- Dictionary compressor with nibbled ANS and optimal parsing. Other compression experiments.☆26Apr 13, 2025Updated 11 months ago
- A set of Docker images for streaming a remote desktop video and audio☆27May 15, 2023Updated 2 years ago
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.☆55Dec 4, 2023Updated 2 years ago
- The apihandyman.io website☆13Oct 12, 2024Updated last year