helgeho / ArchiveSparkView on GitHub
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
158Oct 8, 2025Updated 5 months ago

Alternatives and similar repositories for ArchiveSpark

Users that are interested in ArchiveSpark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?