richardlehane / webarchive
golang readers for ARC and WARC webarchive formats
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for webarchive
- Golang WARC (Web ARChive) Library☆29Updated 5 years ago
- Read and write WARC files in Go☆41Updated 6 years ago
- A golang library to work with WARC files from the common crawl☆14Updated 6 years ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆22Updated 10 years ago
- ☆27Updated 7 years ago
- RAIS: A IIIF-compliant, 100% open source image server for blazing-fast deep zooming☆78Updated 3 weeks ago
- A Memento Aggregator CLI and Server in Go☆57Updated 6 months ago
- OCFL implementation for Go☆13Updated this week
- Span formats.☆17Updated 2 weeks ago
- Digital Preservation of HTTP in documentary heritage.☆22Updated last year
- language-agnostic syntax for documenting & outlining packages☆10Updated 6 months ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆39Updated 5 years ago
- A JSON-LD processor for Go☆114Updated 5 years ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆43Updated last year
- A mini LDP Server written in Go.☆11Updated 8 years ago
- Nifty library to manage, query and store RDF triples. Make RDF great again!☆115Updated 5 years ago
- Go package to implement the IIIF Image API.☆89Updated 3 months ago
- Convert HTTP Archive (HAR) -> Web Archive (WARC) format☆46Updated 6 years ago
- Go package to parse GEDCOM files.☆37Updated 2 months ago
- mediawiki dump parser for loading up wikipedia data☆101Updated 11 months ago
- Go library used to flip text☆18Updated last year
- Command line tool for digging into WARC files☆34Updated 3 weeks ago
- Package valuegraph produces a graph representation of any Go value.☆32Updated 6 years ago
- An IETF Internet Draft for the Multihash data format☆10Updated last year
- Package goling provides natural language processing tools.☆39Updated 7 years ago
- oldweb.today Remote/Containerized Browser System☆10Updated 5 years ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 9 years ago
- Email is a command line program that can send attachments, and stdin as the body of an email.☆13Updated 9 years ago
- Command line tool for querying and retrieving records from OAI-PMI providers.☆10Updated 7 years ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆39Updated 9 years ago