nla / httrack2warc
Converts HTTrack crawls to WARC files
β32Updated 5 months ago
Alternatives and similar repositories for httrack2warc:
Users that are interested in httrack2warc are comparing it to the libraries listed below
- ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.β14Updated 3 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.β13Updated 3 months ago
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β52Updated 5 months ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Nowβ114Updated 4 months ago
- Archiving public telegram messages.β12Updated 3 weeks ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.β28Updated 3 months ago
- simple script to convert web resources to a single warc fileβ19Updated last year
- Home of the official apt/deb package for Ubuntu/Debian-based systems.β17Updated 3 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ49Updated 3 weeks ago
- PRE & NFO database and notification service for warez scene releases. This repository contains the frontend code written in Next.js and Cβ¦β31Updated 3 years ago
- Conifer setup and deployment via Ansibleβ12Updated 4 years ago
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β14Updated 4 years ago
- A youtube-dl extension with pluggable extractorsβ46Updated last month
- Copy all Google Fonts to a folderβ10Updated 6 years ago
- A yt-dlp extractor plugin to bypass YouTube age-gateβ49Updated 2 years ago
- A server to collect & archive websites that also supports video downloadsβ85Updated last year
- Archiving URLs (outlinks) from a variety of sources.β19Updated last week
- Electromagnet electrifies your torrents by automatically adding lots of stable trackers from newTrackon to all magnet links as you browseβ¦β30Updated 4 years ago
- A yt-dlp extractor plugin to decrypt YouTube nsig using Deno https://deno.landβ19Updated last month
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.β108Updated 3 weeks ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β37Updated 2 weeks ago
- β11Updated 3 years ago
- Encode/decode binary data over a live streaming video in real time.β13Updated last year
- Scripts for backing up your posts, likes and media files from Tumblrβ20Updated 5 years ago
- Simple IPFS-based file sharing, modified from pomf.se (RIP in pieces)β10Updated 7 years ago
- Post-processor plugin to use DeArrow video titles in YT-DLPβ11Updated last year
- Stig's Art Grabr is a userscript for grabbing high resolution album cover-art from various sites - Can also be used as a bookmarklet/faveβ¦β14Updated last year
- Download documents from issuu.com.β19Updated last year
- Browser userscript to clean up hyperlink redirections and link shimsβ19Updated 2 years ago