nla / httrack2warcLinks
Converts HTTrack crawls to WARC files
☆33Updated 11 months ago
Alternatives and similar repositories for httrack2warc
Users that are interested in httrack2warc are comparing it to the libraries listed below
Sorting:
- Archiving public telegram messages.☆13Updated this week
- ☆11Updated 3 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆56Updated 11 months ago
- ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.☆14Updated 4 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 9 months ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆125Updated 6 months ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Updated 4 years ago
- A server to collect & archive websites that also supports video downloads☆86Updated 2 years ago
- The (new) discovery backend for https://odcrawler.xyz☆30Updated 2 years ago
- Home of the official apt/deb package for Ubuntu/Debian-based systems.☆17Updated 9 months ago
- Conifer setup and deployment via Ansible☆12Updated 5 years ago
- automatically check all downloads against 68 anti-virus solutions (through VirusTotal API)☆19Updated 4 years ago
- 🚀 Stormhen (based on Mozilla Thunderbird) portable for Windows☆21Updated 3 months ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆129Updated 3 months ago
- Grabbing everything from reddit.☆61Updated last year
- Scrape https://unlistedvideos.com/☆15Updated 4 years ago
- Set of miscellaneous scripts for personal use.☆14Updated last year
- Strip advertisements from downloaded YouTube videos☆59Updated 3 years ago
- [mirror] Backup a list of github starred repositories for the specified user.☆138Updated 2 years ago
- Maximize upload while minimizing download to achieve high share ratios☆11Updated 4 months ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated 9 months ago
- Server and bookmarklet to download files via youtube-dl directly from your browser. Cross platform single binary installation, web browse…☆76Updated last month
- Fake Seeder for Torrent☆12Updated 5 years ago
- plugin manager for yt-dlp which enables releases of extractors as separate python package☆56Updated 2 months ago
- Recover lost websites from the Web Infrastructure☆89Updated 4 years ago
- Mozilla LZ4 File Decryption and Mining Tools☆37Updated 2 months ago
- The ArchiveWeb.page Site☆31Updated 7 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆61Updated 4 months ago
- Copy all Google Fonts to a folder☆10Updated 6 years ago
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year