Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
☆138Mar 19, 2026Updated 2 months ago
Alternatives and similar repositories for wget-lua
Users that are interested in wget-lua are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved☆31Sep 20, 2025Updated 8 months ago
- A tool for archiving DokuWiki☆28Apr 2, 2026Updated 2 months ago
- Sources for urls-grab.☆13Updated this week
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆20Jul 11, 2025Updated 10 months ago
- Wget-compatible web downloader and crawler.☆609Apr 29, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns☆1,572May 23, 2025Updated last year
- Wget with Lua extension☆24Dec 17, 2015Updated 10 years ago
- Command line tool for digging into WARC files☆49Updated this week
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆1,047Updated this week
- Python web interface to rTorrent.☆26Feb 10, 2021Updated 5 years ago
- Specifications developed and maintained by the Webrecorder community.☆140Oct 16, 2025Updated 7 months ago
- The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project☆28Jul 17, 2025Updated 10 months ago
- Nondestructive warc-in-tar to warc conversion☆27Apr 21, 2013Updated 13 years ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆143Apr 3, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Read and write WARC files in Go☆49Updated this week
- Web archive index server based on RocksDB☆43Updated this week
- Web site for the Squash Compression Benchmark☆18Feb 16, 2018Updated 8 years ago
- A Tool To Push Web Resources Into Web Archives☆434Jan 23, 2024Updated 2 years ago
- Create and edit WARC and WACZ files☆29Dec 6, 2024Updated last year
- Archiving URLs (outlinks) from a variety of sources.☆25May 22, 2026Updated 2 weeks ago
- A configurable, reusable tracker with dashboard☆36Dec 15, 2023Updated 2 years ago
- ☆17Apr 16, 2026Updated last month
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆58Jun 2, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Docker Container for grab-site☆13Aug 26, 2024Updated last year
- Sublime Text API Version Documenter☆11Jan 3, 2023Updated 3 years ago
- External link tracking tool for Wikimedia partnerships☆11Oct 3, 2025Updated 8 months ago
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆16Jun 10, 2021Updated 5 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- ☆59Apr 11, 2024Updated 2 years ago
- WARC writing MITM HTTP/S proxy☆453Jun 3, 2026Updated last week
- Webrecorders DevTools Protocol Automation Library☆18Oct 18, 2022Updated 3 years ago
- brozzler - distributed browser-based web crawler☆799May 19, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆85Mar 30, 2026Updated 2 months ago
- A S3 hybrid storage interface for dat and hyperdrive☆13Jul 31, 2018Updated 7 years ago
- Streaming WARC/ARC library for fast web archive IO☆458Apr 6, 2026Updated 2 months ago
- Utilities for Arch Linux development, in a flake☆14Apr 20, 2026Updated last month
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Oct 19, 2020Updated 5 years ago
- ArchiveBot, an IRC bot for archiving websites☆415Apr 17, 2026Updated last month
- Homographs: brutefind homographs within a font☆19Apr 21, 2017Updated 9 years ago