Internet Archive's Sparkling Data Processing Library
☆15Feb 6, 2026Updated 3 weeks ago
Alternatives and similar repositories for Sparkling
Users that are interested in Sparkling are comparing it to the libraries listed below
Sorting:
- Web application for distributed compute analysis of Archive-It web archive collections.☆20Oct 9, 2025Updated 4 months ago
- ☆16Apr 19, 2025Updated 10 months ago
- Web Archiving Course☆23Mar 4, 2024Updated last year
- OCFL tools in Python☆25Aug 22, 2025Updated 6 months ago
- ☆26May 5, 2023Updated 2 years ago
- This project has been archived and is no longer being developed or supported. The Curator's Workbench is an extensible digital collectio…☆24Jun 25, 2020Updated 5 years ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.☆39Nov 24, 2025Updated 3 months ago
- Common web archive utility code.☆61Feb 6, 2026Updated 3 weeks ago
- --DEPRECATED--. Use other top level repository under IntellectualHeaven.☆42Jan 30, 2015Updated 11 years ago
- ☆12Nov 22, 2024Updated last year
- PAP/API Lite eller PAPILITE som det förkortas till, är ett oberoende och öppet REST API med alla postnummer och postorter för Sverige, Da…☆10Jul 10, 2022Updated 3 years ago
- RadiaSoft utilities for modeling linear accelerators, including the Hellweg code☆10Nov 29, 2025Updated 3 months ago
- Command line tool for digging into WARC files☆51Updated this week
- Vossian Antonomasia☆10Oct 17, 2025Updated 4 months ago
- Windows Dev Home Application☆17Jan 29, 2024Updated 2 years ago
- WSDM 2021 Tutorial on Advances in Bias-aware Recommendation on the Web☆11Mar 8, 2021Updated 4 years ago
- OpenPGP in Python using Sequoia PGP☆18Updated this week
- MS Marco Entity Annotations Disambiguation☆13May 19, 2023Updated 2 years ago
- [TPAMI-2018] A C++ framework for training/testing Support Vector Machine with Gaussian Sample Uncertainty (SVM-GSU).☆13Feb 20, 2018Updated 8 years ago
- 移动端UI自动化测试脚本,Appium + Cucumber测试模式,Ruby编写。https://www.jianshu.com/p/c3db8e5dc306☆11Jun 15, 2018Updated 7 years ago
- (Note: This repository is obsolete, please see the new Browsertrix webrecorder/browsertrix) Browser-Based On-Demand Web Archiving Automat…☆38Apr 23, 2019Updated 6 years ago
- Tools to analyze web archives☆20Jul 12, 2016Updated 9 years ago
- Scripts for scraping metadata from Academia.edu and migrating publications into Zenodo.org via its REST API☆12Jan 25, 2017Updated 9 years ago
- Automating description for Web Archives in ArchivesSpace using the Archive-It CDX and Partner Data APIs☆11Aug 10, 2018Updated 7 years ago
- functionality on top of an RDF store while accounting for and exploiting the fundamental differences between graph storage and relation…☆12Feb 21, 2024Updated 2 years ago
- An evil web server.☆13May 9, 2015Updated 10 years ago
- Fork of the ski ia64 emulator☆12May 15, 2016Updated 9 years ago
- A minter for opaque identifiers in the style of California Digital Library's NOID.☆13May 14, 2018Updated 7 years ago
- Yet Another SEquence Tagger☆10Dec 8, 2022Updated 3 years ago
- ☆10Oct 2, 2022Updated 3 years ago
- INWX Domrobot Ruby Client☆11Oct 17, 2025Updated 4 months ago
- Python script to create CDX index files of WARC data☆16Sep 7, 2018Updated 7 years ago
- Helm chart definitions for the Mongoose stack (MongooseIM, MongoosePush)☆14Feb 3, 2026Updated 3 weeks ago
- A collection of Islandora microservices, lovingly known as Crayfish.☆10Jul 28, 2025Updated 7 months ago
- vim plugin that support doing CRs and MRs in gitlab☆12May 30, 2020Updated 5 years ago
- EXPERIMENTAL PROTOTYPE code for "Bolt-on Causal Consistency" appearing in SIGMOD 2013☆12Nov 2, 2013Updated 12 years ago
- pip install patchelf. patchelf Python wheel for PyPI.☆11Feb 23, 2026Updated last week
- ☆10Apr 11, 2017Updated 8 years ago
- A framework for creating digital exhibits by loading collection metadata directly from a CSV (such as a published Google Sheet!). See the…☆14Feb 20, 2026Updated last week