Converts HTTrack crawls to WARC files
☆34Aug 6, 2024Updated last year
Alternatives and similar repositories for httrack2warc
Users that are interested in httrack2warc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Web archive index server based on RocksDB☆43May 1, 2026Updated 2 weeks ago
- CDXJ Indexing of WARC/ARCs☆34May 11, 2026Updated last week
- ☆30Jun 6, 2024Updated last year
- Nondestructive warc-in-tar to warc conversion☆27Apr 21, 2013Updated 13 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆57Aug 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scripts for FFmpeg☆18Sep 9, 2023Updated 2 years ago
- Fetch git-annex metadata from IMDB☆10Feb 10, 2018Updated 8 years ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆98Apr 22, 2025Updated last year
- Command line tool for digging into WARC files☆49May 9, 2026Updated last week
- Diff two unist trees☆14Aug 21, 2020Updated 5 years ago
- Parse And Create Web ARChive (WARC) files with node.js☆104Jan 29, 2025Updated last year
- 🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...☆38Aug 12, 2018Updated 7 years ago
- A server to collect & archive websites that also supports video downloads☆85Feb 11, 2023Updated 3 years ago
- Clone of https://git.kernel.org/pub/scm/linux/kernel/git/jejb/sbsigntools.git/ with patches for yubikey support☆10Aug 14, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- JS Streaming WARC IO optimized for Browser and Node☆52Mar 25, 2026Updated last month
- ☆16Dec 13, 2014Updated 11 years ago
- 404Games Wastelands V2 - Chernarus☆25Jun 25, 2013Updated 12 years ago
- Support for writing WARC files with Scrapy☆24Dec 21, 2019Updated 6 years ago
- Verifiable Credential Extensions☆12Feb 12, 2025Updated last year
- A client library for interacting with the Gogs REST api.☆13Apr 30, 2019Updated 7 years ago
- Conifer setup and deployment via Ansible☆12Jun 15, 2020Updated 5 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆48Mar 19, 2018Updated 8 years ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆1,039May 13, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆59Apr 11, 2024Updated 2 years ago
- Les réflexions menées au cours du 404CTF 2023 pour résoudre les challenges proposés☆10Dec 16, 2023Updated 2 years ago
- CaddyServer module for processing images on the fly.☆15Nov 24, 2025Updated 5 months ago
- A default backend (404 page) for nginx-ingress in Kubernetes☆13Jan 23, 2018Updated 8 years ago
- The Zonemaster Backend - part of the Zonemaster project☆16Dec 19, 2025Updated 5 months ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Oct 9, 2017Updated 8 years ago
- PlayStation GPU (WIP)☆18Oct 3, 2023Updated 2 years ago
- 同程巡风项目Docker镜像版☆24Dec 22, 2016Updated 9 years ago
- CVE-2021-40438 exploit PoC with Docker setup.☆12Oct 24, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Sep 9, 2021Updated 4 years ago
- A Simple C++ based CSSParser☆18May 13, 2026Updated last week
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)☆446Sep 17, 2020Updated 5 years ago
- Tools to Work with the Web Archive Ecosystem in R☆20Aug 20, 2017Updated 8 years ago
- Parse WARC (Web Archive Files) as a node.js stream☆23Oct 20, 2014Updated 11 years ago
- Simple tool that removes link masking/tracking and optionally resolves shortened links.☆28Oct 11, 2022Updated 3 years ago
- u-boot addon image for the AVM FritzBox 4040☆12Mar 8, 2026Updated 2 months ago