chfoo / warcat-rsLinks
Command-line tool and Rust library for handling Web ARChive (WARC) files
☆20Updated 3 weeks ago
Alternatives and similar repositories for warcat-rs
Users that are interested in warcat-rs are comparing it to the libraries listed below
Sorting:
- Command line tool for digging into WARC files☆40Updated 2 weeks ago
- A tool for collection archival slivers of the web and web archives☆13Updated 4 months ago
- ☆10Updated 3 years ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆32Updated last month
- A tool for detecting viruses and NSFW material in WARC files☆15Updated 10 months ago
- A command line utility for listing and searching snapshots in web archives☆16Updated last year
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆18Updated 3 months ago
- Save data from Mastodon to a SQLite database☆28Updated last year
- Static Site Generator for Viewing Web Archives (in WACZ) format☆27Updated last year
- ☆13Updated 2 months ago
- A Github Action for turning Markdown into ReSpec HTML☆14Updated last year
- ☆46Updated last year
- Create and edit WARC and WACZ files☆10Updated 6 months ago
- Specifications developed and maintained by the Webrecorder community.☆131Updated 5 months ago
- Follow the cryptocurrency industry’s influence on 2024 elections in the United States.☆33Updated last week
- A ServiceWorker for client-side reconstruction of composite mementos☆15Updated 3 months ago
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆162Updated last week
- ☆11Updated 3 months ago
- Downloads and imports Wikipedia page histories to a git repository☆35Updated 6 months ago
- CDXJ Indexing of WARC/ARCs☆26Updated 6 months ago
- Download and attach provenance to public datasets☆33Updated 2 months ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆19Updated 3 months ago
- Python IMage MIning☆14Updated 3 months ago
- search interface for scholarly works☆85Updated 10 months ago
- Convert PDF to fixed-layout EPUB, conserving the table of contents, inner cross-references and hyperlinks.☆37Updated 6 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆85Updated 2 months ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- JavaScript module and CLI tool for working with web archive data using the WACZ format specification.☆14Updated 3 months ago
- Collect links to read or watch later in your RSS reader.☆80Updated 7 months ago
- Specifications for Fediverse Auxiliary Service Providers☆73Updated 3 weeks ago