📚 A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivity
☆99Sep 27, 2018Updated 7 years ago
Alternatives and similar repositories for research
Users that are interested in research are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆98Apr 22, 2025Updated last year
- Web application to allow users to add content metadata about crawled resources☆13Feb 15, 2018Updated 8 years ago
- Tools for access, "diff"-ing, and analyzing archived web pages☆23Updated this week
- An Awesome List for getting started with web archiving☆2,547Apr 27, 2026Updated 3 weeks ago
- Update a local archive of your tweets.☆49Oct 12, 2012Updated 13 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A ServiceWorker for client-side reconstruction of composite mementos☆15Mar 6, 2025Updated last year
- linkbak is a web page archiver : it reads a list of links and dumps the corresponding pages in HTML and PDF.☆13Dec 8, 2022Updated 3 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆174May 19, 2020Updated 6 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆12Oct 5, 2024Updated last year
- Web Archiving Integration Layer: One-Click User Instigated Preservation☆395Apr 23, 2026Updated 3 weeks ago
- The OpenWayback Development☆519Jan 3, 2024Updated 2 years ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in t…☆132Nov 21, 2025Updated 6 months ago
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 7 years ago
- Parse OCR result files for pagenos, tables of contents, etc.☆14Nov 30, 2011Updated 14 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- brozzler - distributed browser-based web crawler☆796May 12, 2026Updated last week
- Host, backup, and share hyperdrive archives☆13Aug 22, 2017Updated 8 years ago
- Protobuf/gRPC schemas for the Hyperdrive API☆14Jul 14, 2020Updated 5 years ago
- Hacking challenges to learn web archive security.☆35Jun 23, 2017Updated 8 years ago
- An HTTP API for tracking and annotating changes to a set of web pages.☆22May 11, 2026Updated last week
- A tool to scrape the ipfs network for information on the number of peers in the network.☆21Mar 22, 2024Updated 2 years ago
- utility to fetch provenance information from Internet Archive's Wayback Machine☆15Feb 5, 2026Updated 3 months ago
- qri electron & web frontend☆23Aug 10, 2021Updated 4 years ago
- The public display of the homosaurus vocabulary.☆12Apr 22, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A list of things related to software, literature, and other content for 🕣 Memento☆114Feb 4, 2026Updated 3 months ago
- This software (prototype) extracts values of Excel spreadsheet properties and calculates a tentative spreadsheet complexity assessment ba…☆13May 15, 2026Updated last week
- simple dat downloading module☆10Jan 11, 2023Updated 3 years ago
- Uploads items into the Internet Archive after they have been downloaded with youtube-dl☆15Feb 28, 2015Updated 11 years ago
- Start here! Discussion for Data Together: Building a better future for data☆46Oct 21, 2019Updated 6 years ago
- Parallelized web crawler written in Golang☆15Oct 2, 2018Updated 7 years ago
- Basic python script to list following and followed blogs on Tumblr☆20Oct 16, 2014Updated 11 years ago
- Documentation for Project Electron☆14Dec 2, 2024Updated last year
- ☆11Dec 15, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- DEPRECATED see coboxcoop/peerfs ~~ multiwriter peer-to-peer filesystem, built on kappa-core and hyperdrive☆15Aug 10, 2019Updated 6 years ago
- Reconcile artist names to the Getty Union List of Artist Names☆19Oct 10, 2016Updated 9 years ago
- push a dat to remote peers☆22May 11, 2020Updated 6 years ago
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)☆446Sep 17, 2020Updated 5 years ago
- Dat's way of encoding and decoding dat links [ DEPRECATED - see https://github.com/mafintosh/abstract-encoding and https://github.com/com…☆18Jan 6, 2022Updated 4 years ago
- qri dataset definition☆15Sep 24, 2021Updated 4 years ago
- Netarchivesuite development☆23May 15, 2026Updated last week