richardlehane / webarchiveLinks
golang readers for ARC and WARC webarchive formats
☆20Updated 2 years ago
Alternatives and similar repositories for webarchive
Users that are interested in webarchive are comparing it to the libraries listed below
Sorting:
- Read and write WARC files in Go☆47Updated 7 years ago
- Golang WARC (Web ARChive) Library☆30Updated 5 years ago
- A golang library to work with WARC files from the common crawl☆15Updated 7 years ago
- Span formats.☆17Updated this week
- RAIS: A IIIF-compliant, 100% open source image server for blazing-fast deep zooming☆78Updated 3 months ago
- Go package to implement the IIIF Image API.☆92Updated last month
- OCFL implementation for Go☆14Updated this week
- A mini LDP Server written in Go.☆11Updated 8 years ago
- Serve millions of JSON documents via HTTP.☆70Updated 8 months ago
- Nifty library to manage, query and store RDF triples. Make RDF great again!☆115Updated 6 years ago
- ☆27Updated 8 years ago
- Go package to parse GEDCOM files.☆39Updated this week
- A Reader/ReaderAt for Go that uses Range requests to get files over HTTP☆28Updated 2 years ago
- Object Resource Stream and CDXJ Drafts☆14Updated 6 years ago
- signature-based file format identification☆238Updated 3 months ago
- Go library and CLI for assisting in sending webmentions.☆53Updated last month
- Go library for parsing microformats☆71Updated last month
- Newshound: The Breaking News Email Aggregator☆88Updated 2 years ago
- mediawiki dump parser for loading up wikipedia data☆106Updated last month
- Miscellaneous tools for processing WARC files from the CommonCrawl☆24Updated 11 years ago
- language-agnostic syntax for documenting & outlining packages☆9Updated last year
- A JSON-LD processor for Go☆115Updated 6 years ago
- A Memento Aggregator CLI and Server in Go☆65Updated 4 months ago
- Linked Data server for Go☆152Updated 4 years ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆39Updated 6 years ago
- Golang port of the boilerpipe Java library used for the removal of boilerplate and extraction of text content from HTML documents.☆70Updated 2 months ago
- Pure Go library for working with RDF, a powerful framework for representing informations as graphs.☆37Updated 8 years ago
- Docker image for the Archives Unleashed Toolkit☆12Updated 2 years ago
- Digital Preservation of HTTP in documentary heritage.☆22Updated 2 years ago
- Converts WARC files to static HTML☆46Updated last year