ArchiveTeam/wget-lua

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ArchiveTeam/wget-lua)

ArchiveTeam / wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

☆137

Alternatives and similar repositories for wget-lua

Users that are interested in wget-lua are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ArchiveTeam / seesaw-kit
View on GitHub
Making a reusable toolkit for writing seesaw scripts
☆75Updated this week
ArchiveTeam / urls-sources
View on GitHub
Sources for urls-grab.
☆14Jun 20, 2026Updated last month
ArchiveTeam / wpull
View on GitHub
Wget-compatible web downloader and crawler.
☆611Apr 29, 2024Updated 2 years ago
alard / wget-lua
View on GitHub
Wget with Lua extension
☆24Dec 17, 2015Updated 10 years ago
ruarxive / awesome-digital-preservation
View on GitHub
Awesome list dedicated to digital and data preservation tools, sources, services and so on.
☆37Mar 9, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ArchiveTeam / grab-site
View on GitHub
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
☆1,601May 23, 2025Updated last year
webrecorder / browsertrix-crawler
View on GitHub
Run a high-fidelity browser-based web archiving crawler in a single Docker container
☆1,087Updated this week
ArchiveTeam / terroroftinytown-client-grab
View on GitHub
The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project
☆28Jul 17, 2025Updated last year
webrecorder / specs
View on GitHub
Specifications developed and maintained by the Webrecorder community.
☆142Oct 16, 2025Updated 9 months ago
alard / megawarc
View on GitHub
Nondestructive warc-in-tar to warc conversion
☆27Apr 21, 2013Updated 13 years ago
unprovable / PastyPass
View on GitHub
Browser agnostic extension that enables pasting into password fields...
☆15Jan 12, 2020Updated 6 years ago
internetarchive / gowarc
View on GitHub
Read and write WARC files in Go
☆53Jul 14, 2026Updated last week
nla / outbackcdx
View on GitHub
Web archive index server based on RocksDB
☆43Jul 9, 2026Updated last week
rurban / perl-hash-stats
View on GitHub
Counting the collisions with perl hash tables per function
☆12Jun 5, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
oduwsdl / archivenow
View on GitHub
A Tool To Push Web Resources Into Web Archives
☆434Jan 23, 2024Updated 2 years ago
ArchiveTeam / urls-grab
View on GitHub
Archiving URLs (outlinks) from a variety of sources.
☆25Jun 26, 2026Updated 3 weeks ago
internetarchive / sandcrawler
View on GitHub
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki
☆28Jul 31, 2024Updated last year
ArchiveTeam / universal-tracker
View on GitHub
A configurable, reusable tracker with dashboard
☆36Dec 15, 2023Updated 2 years ago
webrecorder / web-archive-site-mirror
View on GitHub
☆17Apr 16, 2026Updated 3 months ago
overcast07 / wayback-machine-spn-scripts
View on GitHub
Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now
☆145Apr 3, 2025Updated last year
michaelmaniscalco / m99
View on GitHub
novel high throughput entropy encoder for BWT data
☆15Aug 10, 2022Updated 3 years ago
Nold360 / docker-grab-site
View on GitHub
Docker Container for grab-site
☆13Aug 26, 2024Updated last year
webrecorder / pywb-remote-browsers
View on GitHub
Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives
☆16Jun 10, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
webrecorder / har2warc
View on GitHub
Convert HTTP Archive (HAR) -> Web Archive (WARC) format
☆55Oct 21, 2018Updated 7 years ago
ArchiveBox / pip-archivebox
View on GitHub
Official Python package for ArchiveBox, the self-hosted internet archiving solution.
☆12Oct 5, 2024Updated last year
ArchiveTeam / WebArchiver
View on GitHub
Decentralized web archiving
☆20Aug 7, 2018Updated 7 years ago
LegacyUpdate / Unstrike
View on GitHub
Automatically rescue a Windows 10 or 11 installation affected by the 19 July 2024 CrowdStrike Falcon crash
☆10Jul 22, 2024Updated last year
ArchiveBox / debian-archivebox
View on GitHub
Home of the official apt/deb package for Ubuntu/Debian-based systems.
☆17Updated this week
N0taN3rd / simplechrome
View on GitHub
Webrecorders DevTools Protocol Automation Library
☆18Oct 18, 2022Updated 3 years ago
openzim / warc2zim
View on GitHub
Command line tool to convert a file in the WARC format to a file in the ZIM format
☆86Mar 30, 2026Updated 3 months ago
tree-sitter / fuzz-action
View on GitHub
Input fuzzing action for tree-sitter parsers
☆18Jul 11, 2025Updated last year
webrecorder / warcit
View on GitHub
Convert Directories, Files and ZIP Files to Web Archives (WARC)
☆99Apr 22, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ArchiveBox / internet-archiving-talk
View on GitHub
🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.
☆15Oct 19, 2020Updated 5 years ago
ArchiveTeam / ArchiveBot
View on GitHub
ArchiveBot, an IRC bot for archiving websites
☆418Apr 17, 2026Updated 3 months ago
yogeshwaran01 / update-my-spotify-playlist
View on GitHub
Automatically update your Spotify playlist with favorite tracks of your favorite artists and genres
☆10Jun 11, 2023Updated 3 years ago
harvard-lil / thread-keeper
View on GitHub
(Experimental) High-fidelity capture of Twitter threads as sealed PDFs.
☆55Dec 4, 2023Updated 2 years ago
oldweb-today / remote-desktop-server
View on GitHub
A set of Docker images for streaming a remote desktop video and audio
☆27May 15, 2023Updated 3 years ago
webrecorder / behaviors
View on GitHub
Webrecorder Automated In-Page Behavior Framework
☆13Apr 21, 2021Updated 5 years ago
nauful / NLZM
View on GitHub
Dictionary compressor with nibbled ANS and optimal parsing. Other compression experiments.
☆25Apr 13, 2025Updated last year