nla / httrack2warcLinks
Converts HTTrack crawls to WARC files
☆33Updated last year
Alternatives and similar repositories for httrack2warc
Users that are interested in httrack2warc are comparing it to the libraries listed below
Sorting:
- ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.☆14Updated 4 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 10 months ago
- Archiving public telegram messages.☆13Updated this week
- Home of the official apt/deb package for Ubuntu/Debian-based systems.☆17Updated 10 months ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆57Updated 11 months ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated 10 months ago
- Recover lost websites from the Web Infrastructure☆89Updated 4 years ago
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- A server to collect & archive websites that also supports video downloads☆86Updated 2 years ago
- ☆11Updated 3 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Updated 4 years ago
- Scrape https://unlistedvideos.com/☆15Updated 4 years ago
- The (new) discovery backend for https://odcrawler.xyz☆32Updated 2 years ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆131Updated 4 months ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user ac…☆54Updated last month
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Updated last year
- Source for the Github Wiki / ReadTheDocs documentation for AchiveBox, the self-hosted internet archiving solution.☆15Updated last week
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 10 months ago
- [mirror] Backup a list of github starred repositories for the specified user.☆139Updated 2 years ago
- Scripts to build and boot warrior virtual machine containing Docker☆119Updated 4 months ago
- 🔒📈 Host file tools written in rust.