richardlehane / webarchiveLinks
golang readers for ARC and WARC webarchive formats
☆20Updated 2 years ago
Alternatives and similar repositories for webarchive
Users that are interested in webarchive are comparing it to the libraries listed below
Sorting:
- Read and write WARC files in Go☆47Updated 7 years ago
- A golang library to work with WARC files from the common crawl☆15Updated 7 years ago
- Golang WARC (Web ARChive) Library☆30Updated 6 years ago
- Span formats.☆17Updated 3 weeks ago
- ☆16Updated 4 months ago
- A mini LDP Server written in Go.☆11Updated 8 years ago
- Serve millions of JSON documents via HTTP.☆70Updated 8 months ago
- Pure Go library for working with RDF, a powerful framework for representing informations as graphs.☆37Updated 8 years ago
- Nifty library to manage, query and store RDF triples. Make RDF great again!☆115Updated 6 years ago
- ☆27Updated 8 years ago
- RAIS: A IIIF-compliant, 100% open source image server for blazing-fast deep zooming☆78Updated 3 months ago
- A forwarding mail server inspired by @alum.mit.edu☆19Updated 9 years ago
- A general purpose application which can be used to host read-only access to one or more Bleve indexes☆13Updated 8 years ago
- Go package to implement the IIIF Image API.☆93Updated last month
- OCFL implementation for Go☆14Updated 3 weeks ago
- Miscellaneous tools for processing WARC files from the CommonCrawl☆24Updated 11 years ago
- Go package to parse GEDCOM files.☆40Updated last week
- A Memento Aggregator CLI and Server in Go☆67Updated 5 months ago
- Digital Preservation of HTTP in documentary heritage.☆22Updated 2 years ago
- Command line tool for digging into WARC files☆44Updated 2 weeks ago
- Object Resource Stream and CDXJ Drafts☆14Updated 6 years ago
- Go sqlite VFS for using a zstd seekable compressed file.☆51Updated this week
- A JSON-LD processor for Go☆115Updated 6 years ago
- Docker image for the Archives Unleashed Toolkit☆12Updated 2 years ago
- You personal database. Mirror of https://git.sr.ht/~tsileo/blobstash☆104Updated 5 years ago
- Golang port of the boilerpipe Java library used for the removal of boilerplate and extraction of text content from HTML documents.☆70Updated 3 months ago
- A Reader/ReaderAt for Go that uses Range requests to get files over HTTP☆28Updated 2 years ago
- signature-based file format identification☆243Updated 3 months ago
- A simple, lightweight, embedded geocoder for Golang with city level accuracy☆73Updated 9 years ago
- mediawiki dump parser for loading up wikipedia data☆106Updated 2 months ago