chfoo / warcat-rsLinks
Command-line tool and Rust library for handling Web ARChive (WARC) files
β25Updated 5 months ago
Alternatives and similar repositories for warcat-rs
Users that are interested in warcat-rs are comparing it to the libraries listed below
Sorting:
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β180Updated 2 months ago
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β55Updated last year
- β54Updated last year
- Command line tool for digging into WARC filesβ47Updated this week
- Specifications developed and maintained by the Webrecorder community.β136Updated last month
- A tool for collection archival slivers of the web and web archivesβ16Updated 9 months ago
- Downloads and imports Wikipedia page histories to a git repositoryβ35Updated 2 weeks ago
- Download and attach provenance to public datasetsβ36Updated 8 months ago
- Create and edit WARC and WACZ filesβ18Updated 11 months ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β38Updated this week
- A command line utility for listing and searching snapshots in web archivesβ17Updated last year
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.β24Updated last year
- Converts WARC files to static HTMLβ49Updated 2 months ago
- A fun tool for quickly browsing unsourced snippets on Wikipedia.β112Updated 3 weeks ago
- A Memento Aggregator CLI and Server in Goβ71Updated 8 months ago
- Document your SQLite tables and columns with in-line commentsβ24Updated 2 years ago
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ188Updated last year
- Centralised repository for WARC usage specifications.β119Updated last month
- CDXJ Indexing of WARC/ARCsβ30Updated 11 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β89Updated 7 months ago
- β13Updated last month
- A command line tool to archive a git repository from GitHub to the Internet Archive.β92Updated 4 years ago
- Save data from Mastodon to a SQLite databaseβ28Updated 4 months ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β167Updated 3 months ago
- Archiving parts of the US government.β28Updated 3 months ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ28Updated 2 years ago
- Find possible host names in a source textβ53Updated 3 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β52Updated this week
- Some examples for Mastodon.pyβ18Updated 8 months ago
- Platform for journalists to search, analyse, categorise and share unstructured dataβ56Updated last week