nla / httrack2warcLinks
Converts HTTrack crawls to WARC files
β33Updated last year
Alternatives and similar repositories for httrack2warc
Users that are interested in httrack2warc are comparing it to the libraries listed below
Sorting:
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Nowβ138Updated 10 months ago
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β58Updated last year
- a gui for TRID ( http://mark0.net/soft-trid-e.html )β21Updated 9 years ago
- A youtube-dl extension with pluggable extractorsβ53Updated 9 months ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.β12Updated last year
- Scripts to build and boot warrior virtual machine containing Dockerβ122Updated 10 months ago
- A server to collect & archive websites that also supports video downloadsβ84Updated 2 years ago
- Strip advertisements from downloaded YouTube videosβ61Updated 4 years ago
- Home of the official apt/deb package for Ubuntu/Debian-based systems.β16Updated last year
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β15Updated 5 years ago
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improvedβ30Updated 4 months ago
- PRE & NFO database and notification service for warez scene releases. This repository contains the frontend code written in Next.js and Cβ¦β32Updated 4 years ago
- Archiving public telegram messages.β17Updated 3 weeks ago
- [mirror] Backup a list of github starred repositories for the specified user.β144Updated 2 years ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.β28Updated last year
- Server and bookmarklet to download files via youtube-dl directly from your browser. Cross platform single binary installation, web browseβ¦β80Updated 8 months ago
- β11Updated 4 years ago
- Simultaneous, resumable and hash-verified downloads from Internet Archive (archive.org)β175Updated 2 years ago
- Mozilla LZ4 File Decryption and Mining Toolsβ38Updated 8 months ago
- Missing addon manager for firefoxβ17Updated 2 years ago
- download books from archive.orgβ32Updated last year
- One button to close any overlay on any websiteβ11Updated 3 years ago
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ75Updated 2 weeks ago
- Scrape https://unlistedvideos.com/β15Updated 4 years ago
- A yt-dlp extractor plugin to bypass YouTube age-gateβ66Updated 11 months ago
- Wayback Machine Downloader. π₯ Download your entire archived websites from the Internet Archive Wayback Machine.β101Updated 3 years ago
- A cookie manager, browser add-on to manage and flag cookies and session data. On stereoids.β96Updated 7 months ago
- Python 3 tools for downloading and preserving wikisβ127Updated 3 weeks ago
- Firefox addon to save images from open tabsβ48Updated 3 weeks ago
- foobar2000 plugin to submit listen history to your Maloja serverβ13Updated 2 years ago