Converts HTTrack crawls to WARC files
☆34Aug 6, 2024Updated last year
Alternatives and similar repositories for httrack2warc
Users that are interested in httrack2warc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- Web archive index server based on RocksDB☆42Apr 1, 2026Updated last month
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 7 years ago
- CDXJ Indexing of WARC/ARCs☆34Apr 22, 2026Updated last week
- ☆30Jun 6, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Nondestructive warc-in-tar to warc conversion☆27Apr 21, 2013Updated 13 years ago
- url canonicalization library for python and java☆40May 22, 2022Updated 3 years ago
- Merges HOSTS files☆12Dec 19, 2025Updated 4 months ago
- Index Filesystem for FUSE☆17Dec 15, 2021Updated 4 years ago
- A command line utility for listing and searching snapshots in web archives☆17Dec 21, 2023Updated 2 years ago
- Fetch git-annex metadata from IMDB☆10Feb 10, 2018Updated 8 years ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆97Apr 22, 2025Updated last year
- ██████╗ ███████╗██████╗ ██╔══██╗██╔════╝██╔══██╗ ██████╔╝█████╗ ██║ ██║ ██╔══██╗██╔══╝ ██║ ██║ ██║ ██║███████╗██████╔╝ ╚═╝ ╚═╝╚═══…☆11Feb 17, 2022Updated 4 years ago
- utility to create an element from a simple CSS selector☆13Aug 1, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Command line tool for digging into WARC files☆49Apr 15, 2026Updated 2 weeks ago
- 🗄 Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...☆38Aug 12, 2018Updated 7 years ago
- A server to collect & archive websites that also supports video downloads☆84Feb 11, 2023Updated 3 years ago
- Create "perfect" snapshots of web pages☆34Apr 10, 2026Updated 3 weeks ago
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago
- ☆16Dec 13, 2014Updated 11 years ago
- Up to date information which trackers have freeleech at the moment☆19Updated this week
- 404Games Wastelands V2 - Chernarus☆25Jun 25, 2013Updated 12 years ago
- Specifications developed and maintained by the Webrecorder community.☆139Oct 16, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Support for writing WARC files with Scrapy☆24Dec 21, 2019Updated 6 years ago
- Verifiable Credential Extensions☆12Feb 12, 2025Updated last year
- One-Click User Instigated Preservation☆128Feb 3, 2019Updated 7 years ago
- A client library for interacting with the Gogs REST api.☆13Apr 30, 2019Updated 7 years ago
- Zig UEFI FreeType Demo☆17Sep 25, 2019Updated 6 years ago
- Conifer setup and deployment via Ansible☆12Jun 15, 2020Updated 5 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆48Mar 19, 2018Updated 8 years ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆25Jan 1, 2014Updated 12 years ago
- ☆58Apr 11, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A simple 404 page that uses the pathname as input to generate a 404 message.☆13Apr 28, 2018Updated 8 years ago
- Material for my React Fundamentals Workshop☆15Dec 27, 2022Updated 3 years ago
- Standard implementation of TRC404☆10Jan 20, 2025Updated last year
- CaddyServer module for processing images on the fly.☆14Nov 24, 2025Updated 5 months ago
- The Zonemaster Backend - part of the Zonemaster project☆16Dec 19, 2025Updated 4 months ago
- 同程巡风项目Docker镜像版☆24Dec 22, 2016Updated 9 years ago
- CVE-2021-40438 exploit PoC with Docker setup.☆12Oct 24, 2021Updated 4 years ago