chfoo / warcat-rsLinks
Command-line tool and Rust library for handling Web ARChive (WARC) files
β26Updated 8 months ago
Alternatives and similar repositories for warcat-rs
Users that are interested in warcat-rs are comparing it to the libraries listed below
Sorting:
- β56Updated last year
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β187Updated 5 months ago
- Command line tool for digging into WARC filesβ50Updated last week
- A tool for collection archival slivers of the web and web archivesβ17Updated 11 months ago
- Create and edit WARC and WACZ filesβ23Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β39Updated 2 months ago
- β16Updated 4 months ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.β24Updated 2 years ago
- Download and attach provenance to public datasetsβ37Updated 10 months ago
- Centralised repository for WARC usage specifications.β124Updated 3 months ago
- A command line utility for listing and searching snapshots in web archivesβ17Updated 2 years ago
- Specifications developed and maintained by the Webrecorder community.β140Updated 3 months ago
- Downloads and imports Wikipedia page histories to a git repositoryβ35Updated 3 months ago
- A Memento Aggregator CLI and Server in Goβ76Updated 11 months ago
- Save data from Mastodon to a SQLite databaseβ29Updated 7 months ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ29Updated 2 years ago
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ188Updated 3 weeks ago
- A Github Action for turning Markdown into ReSpec HTMLβ15Updated last year
- Document your SQLite tables and columns with in-line commentsβ24Updated 2 years ago
- CDXJ Indexing of WARC/ARCsβ32Updated last year
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β55Updated 2 years ago
- Web archive index server based on RocksDBβ38Updated last week
- A fun tool for quickly browsing unsourced snippets on Wikipedia.β116Updated last week
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β169Updated 5 months ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in tβ¦β132Updated 2 months ago
- Converts WARC files to static HTMLβ51Updated 4 months ago
- β16Updated 9 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β92Updated 9 months ago
- wabac.js - Web Archive Browsing Augmentation Clientβ122Updated last week
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β134Updated last week