nla / httrack2warcLinks
Converts HTTrack crawls to WARC files
β33Updated last year
Alternatives and similar repositories for httrack2warc
Users that are interested in httrack2warc are comparing it to the libraries listed below
Sorting:
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β59Updated last year
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.β13Updated last year
- A collection of tools for archiving and analysing the internet.β78Updated 3 years ago
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β15Updated 5 years ago
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ75Updated 8 months ago
- [mirror] Backup a list of github starred repositories for the specified user.β142Updated 2 years ago
- A list of things related to software, literature, and other content for π£ Mementoβ102Updated last year
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.β28Updated last year
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Nowβ133Updated 7 months ago
- Scripts to build and boot warrior virtual machine containing Dockerβ122Updated 7 months ago
- β11Updated 3 years ago
- Server and bookmarklet to download files via youtube-dl directly from your browser. Cross platform single binary installation, web browseβ¦β80Updated 6 months ago
- A server to collect & archive websites that also supports video downloadsβ85Updated 2 years ago
- Recover lost websites from the Web Infrastructureβ89Updated 3 months ago
- Strip advertisements from downloaded YouTube videosβ60Updated 4 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.β130Updated 3 months ago
- Archiving public telegram messages.β16Updated 3 months ago
- A youtube-dl extension with pluggable extractorsβ53Updated 7 months ago
- Home of the official apt/deb package for Ubuntu/Debian-based systems.β17Updated last year
- The ArchiveWeb.page Siteβ30Updated 3 weeks ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.β92Updated 4 years ago
- URLTeam's second generation of URL shortener archiving toolsβ79Updated 2 months ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β167Updated 3 months ago
- foobar2000 plugin to submit listen history to your Maloja serverβ12Updated 2 years ago
- Searches the internet for DDL links and sends them to your favourite download managerβ62Updated 6 years ago
- Tool and library for handling Web ARChive (WARC) files.β165Updated last year
- Conifer setup and deployment via Ansibleβ12Updated 5 years ago
- PRE & NFO database and notification service for warez scene releases. This repository contains the frontend code written in Next.js and Cβ¦β31Updated 4 years ago
- download books from archive.orgβ32Updated last year
- Specifications developed and maintained by the Webrecorder community.β136Updated last month