helgeho / ArchiveSparkLinks

An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
150Updated 3 weeks ago

Alternatives and similar repositories for ArchiveSpark

Users that are interested in ArchiveSpark are comparing it to the libraries listed below

Sorting: