oduwsdl / CarbonDate
Estimating the age of web resources
β92Updated 7 months ago
Related projects: β
- A list of things related to software, literature, and other content for π£ Mementoβ85Updated 3 months ago
- A Memento Aggregator CLI and Server in Goβ55Updated 4 months ago
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ167Updated 2 weeks ago
- β62Updated 4 years ago
- DEPRECATED. Desktop graph visualization applicationβ50Updated last year
- BotSlayer Community Editionβ35Updated last year
- wabac.js - Web Archive Browsing Augmentation Clientβ97Updated this week
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β49Updated 2 months ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β111Updated this week
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β100Updated last month
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β149Updated 4 years ago
- Archive a reddit user's post history. Formatted overview of a profile, JSON containing every post, and picture downloads. Uses the pushsβ¦β50Updated 2 years ago
- Codec is a collaborative tool for managing video evidence.β58Updated 5 months ago
- The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywordsβ72Updated 5 months ago
- A tiny client side tool that retrieves the timestamp from Tiktok videos.β44Updated last year
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior Systemβ88Updated 3 years ago
- Script and sample dataset of all urban dictionary entry names (around 1.4 million total)β81Updated 2 years ago
- A minimum-dependency ECMAScript client library and CLI tool for Parler β a "free speech" social network that accepts real money to buy "iβ¦β68Updated 3 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β81Updated last month
- WARC and ARC indexing and discovery tools.β114Updated last month
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.β13Updated last year
- community siteβ16Updated 5 years ago
- Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archiveβ¦β22Updated last year
- COVID-19 Malicious Domain Research Dataβ16Updated 4 years ago
- Grabbing all news.β62Updated 4 years ago
- This windows CLI app lets you collect data from twitter via REST API and convert it into a CSV data set that can be used with Gephi. Otheβ¦β25Updated 3 years ago
- scraper for facebook, gab, google and tiktokβ22Updated 2 months ago
- A tool to detect whether a PDF has a bad redactionβ122Updated 2 months ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ91Updated 5 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a headβ166Updated 4 years ago