internetarchive / warctools
Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)
☆159Updated 4 years ago
Alternatives and similar repositories for warctools:
Users that are interested in warctools are comparing it to the libraries listed below
- Tool and library for handling Web ARChive (WARC) files.☆156Updated 6 months ago
- Centralised repository for WARC usage specifications.☆109Updated 4 months ago
- Python library for reading and writing warc files☆239Updated 3 years ago
- WARC and ARC indexing and discovery tools.☆123Updated last month
- NOTE: This project is no longer being actively developed.. Check out https://replayweb.page / https://github.com/webrecorder/replayweb.pa…☆201Updated 2 months ago
- Serving content from a WARC☆61Updated 12 years ago
- A list of things related to software, literature, and other content for 🕣 Memento☆97Updated 10 months ago
- Converts WARC files to static HTML☆44Updated 9 months ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆170Updated 4 years ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆85Updated 3 weeks ago
- A collection of tools for archiving and analysing the internet.