richardlehane / webarchiveView external linksLinks
golang readers for ARC and WARC webarchive formats
☆20Apr 3, 2023Updated 2 years ago
Alternatives and similar repositories for webarchive
Users that are interested in webarchive are comparing it to the libraries listed below
Sorting:
- Read and write WARC files in Go☆48Apr 9, 2018Updated 7 years ago
- ☆17Mar 31, 2025Updated 10 months ago
- A Memento Aggregator CLI and Server in Go☆76Mar 4, 2025Updated 11 months ago
- A golang library to work with WARC files from the common crawl☆15Feb 20, 2018Updated 7 years ago
- A tool for collection archival slivers of the web and web archives☆17Feb 18, 2025Updated 11 months ago
- Golang WARC (Web ARChive) Library☆30Aug 6, 2019Updated 6 years ago
- An evil web server.☆13May 9, 2015Updated 10 years ago
- ☆11Nov 21, 2025Updated 2 months ago
- visualizations/charts for media collections, based on mediainfo☆14Sep 15, 2022Updated 3 years ago
- Object Resource Stream and CDXJ Drafts☆14Nov 28, 2018Updated 7 years ago
- A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (me…☆14Nov 15, 2021Updated 4 years ago
- DASL — Data-Addressed Structures & Links☆17Feb 4, 2026Updated last week
- OCFL implementation for Go☆16Feb 6, 2026Updated last week
- A mini LDP Server written in Go.☆11Sep 14, 2016Updated 9 years ago
- Java library implementing the IIIF Presentation API☆15Feb 28, 2018Updated 7 years ago
- https://arxiv.org/html/2402.02668v2☆21Apr 23, 2025Updated 9 months ago
- JavaScript module and CLI tool for working with web archive data using the WACZ format specification.☆16Mar 11, 2025Updated 11 months ago
- A Memento TimeGate☆44May 4, 2020Updated 5 years ago
- dataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections☆24Aug 6, 2025Updated 6 months ago
- A CLI for OCFL repositories☆20Jan 25, 2026Updated 2 weeks ago
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Oct 9, 2017Updated 8 years ago
- Automatic RESTful OpenAPI server from a SQLite database.☆29Jul 31, 2023Updated 2 years ago
- Easily add authentication to your postgrest API☆25Feb 14, 2019Updated 6 years ago
- OAI-PMH plugin for Solr☆23May 12, 2021Updated 4 years ago
- ☆27Oct 14, 2022Updated 3 years ago
- Service for creating Twitter datasets for research and archiving.☆26Dec 7, 2022Updated 3 years ago
- Automatically archive your repository's GitHub Pages in the Wayback Machine.☆28Jan 30, 2024Updated 2 years ago
- mirror a website, put it in a bag☆24Dec 18, 2022Updated 3 years ago
- CDXJ Indexing of WARC/ARCs☆32Dec 10, 2024Updated last year
- wabac.js - Web Archive Browsing Augmentation Client☆122Updated this week
- This project has been archived and is no longer being developed or supported. The Curator's Workbench is an extensible digital collectio…☆24Jun 25, 2020Updated 5 years ago
- Archive Research Services Workshop☆31Sep 29, 2017Updated 8 years ago
- A simple demonstration of building a Retrieval Augmented Generation (RAG) system using SQLite and Ollama for local, on-device vector sear…☆37Nov 12, 2024Updated last year
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆39Nov 24, 2025Updated 2 months ago
- Save pages to the Wayback Machine as part of your CI/CD pipeline☆32Updated this week
- IIIF Presentation API implementation in Python☆35Apr 17, 2024Updated last year
- Trough: Big data, small databases.☆41Jul 25, 2024Updated last year
- Web archive index server based on RocksDB☆38Feb 1, 2026Updated last week
- Parses BGP/AS data from multiple different sources☆11Dec 4, 2021Updated 4 years ago