📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity
☆99Sep 27, 2018Updated 7 years ago
Alternatives and similar repositories for research
Users that are interested in research are comparing it to the libraries listed below
Sorting:
- Parse OCR result files for pagenos, tables of contents, etc.☆14Nov 30, 2011Updated 14 years ago
- A ServiceWorker for client-side reconstruction of composite mementos☆16Mar 6, 2025Updated 11 months ago
- Web application to allow users to add content metadata about crawled resources☆13Feb 15, 2018Updated 8 years ago
- linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.☆13Dec 8, 2022Updated 3 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆132Nov 21, 2025Updated 3 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆92Apr 22, 2025Updated 10 months ago
- An Awesome List for getting started with web archiving☆2,481Jan 19, 2026Updated last month
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆173May 19, 2020Updated 5 years ago
- Web Archiving Integration Layer: One-Click User Instigated Preservation☆391Mar 12, 2025Updated 11 months ago
- Tools for access, "diff"-ing, and analyzing archived web pages☆21Updated this week
- The OpenWayback Development☆516Jan 3, 2024Updated 2 years ago
- A Memento Aggregator CLI and Server in Go☆77Mar 4, 2025Updated 11 months ago
- Simplifying the process of launching an open data repository. [RETIRED]☆20Jan 7, 2015Updated 11 years ago
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 6 years ago
- Browser extension to easily save code snippets from the web to Codever☆12Sep 1, 2021Updated 4 years ago
- Update a local archive of your tweets.☆49Oct 12, 2012Updated 13 years ago
- brozzler - distributed browser-based web crawler☆788Updated this week
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13Dec 13, 2022Updated 3 years ago
- Start here! Discussion for Data Together: Building a better future for data☆46Oct 21, 2019Updated 6 years ago
- A command line tool that queries the Open Corporates Database and returns data on corporations under the copyleft Open Database License.☆33Jan 22, 2023Updated 3 years ago
- A collection of awesome web scaper, crawler.☆283Apr 4, 2024Updated last year
- This repository houses materials related to the NDSRNY 2016 Symposium, Let's Get Digital☆11May 18, 2016Updated 9 years ago
- Example osquery configuration for Linux servers using eBPF for events☆16Aug 27, 2021Updated 4 years ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆62Jul 9, 2024Updated last year
- Core Python Web Archiving Toolkit for replay and recording of web archives☆1,627Jan 21, 2026Updated last month
- DEPRECATED see coboxcoop/peerfs ~~ multiwriter peer-to-peer filesystem, built on kappa-core and hyperdrive☆15Aug 10, 2019Updated 6 years ago
- A tool for Mastodon users to protect themselves against harassment☆15Nov 18, 2022Updated 3 years ago
- A simple, lightweight and free notes tool☆17Aug 10, 2023Updated 2 years ago
- Microdata schema for historical data.☆31Jun 12, 2012Updated 13 years ago
- How to use Scrivener to write HTMLBook☆17Jun 15, 2021Updated 4 years ago
- simple dat downloading module☆10Jan 11, 2023Updated 3 years ago
- UI to enable analysts to quickly assess changes to monitored government websites☆37Feb 15, 2026Updated 2 weeks ago
- simple script to convert web resources to a single warc file☆22May 11, 2023Updated 2 years ago
- A search engine built on the Unpaywall database☆20Mar 13, 2024Updated last year
- qri dataset definition☆15Sep 24, 2021Updated 4 years ago
- high level thoughts and issues for the future of cabal☆14Jan 9, 2024Updated 2 years ago
- Legal Code for the State of Utah☆44Apr 8, 2014Updated 11 years ago
- Interact with ArchiveBox to automatically archive all your saved reddit posts and comments.☆19Nov 26, 2022Updated 3 years ago