datatogether / researchLinks
π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity
β95Updated 6 years ago
Alternatives and similar repositories for research
Users that are interested in research are comparing it to the libraries listed below
Sorting:
- A list of things related to software, literature, and other content for π£ Mementoβ98Updated last year
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β54Updated 3 months ago
- A Memento Aggregator CLI and Server in Goβ64Updated 2 months ago
- wabac.js - Web Archive Browsing Augmentation Clientβ108Updated this week
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β113Updated 2 weeks ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β85Updated last month
- WARC and ARC indexing and discovery tools.β124Updated 2 months ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β160Updated 4 years ago
- A dockerized, queued high fidelity web archiver based on Squidwarcβ60Updated 10 months ago
- Converts WARC files to static HTMLβ44Updated 11 months ago
- WASAPI data transfer APIsβ44Updated 3 years ago
- Specifications developed and maintained by the Webrecorder community.β131Updated 4 months ago
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.ioβ38Updated 9 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a headβ170Updated 5 years ago
- CDXJ Indexing of WARC/ARCsβ25Updated 5 months ago
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior Systemβ87Updated 4 years ago
- Comparing warc filesβ17Updated 6 years ago
- Trough: Big data, small databases.β42Updated 10 months ago
- Webrecorder Automated In-Page Behavior Frameworkβ13Updated 4 years ago
- React components to render differences between captures at the Wayback Machineβ34Updated last month
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ179Updated 7 months ago
- Web archive index server based on RocksDBβ34Updated 3 weeks ago
- The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.β144Updated last year
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml formatβ54Updated 2 years ago
- Command line tool for digging into WARC filesβ40Updated 2 weeks ago
- A collection of tools for archiving and analysing the internet.β77Updated 2 years ago
- Specification for authentication and creating signed WACZ Filesβ10Updated 3 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.β46Updated 7 years ago
- A web browser that lets you save and organise the pages you visit.β43Updated 5 years ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β160Updated last month