oduwsdl / CarbonDateLinks
Estimating the age of web resources
β96Updated last month
Alternatives and similar repositories for CarbonDate
Users that are interested in CarbonDate are comparing it to the libraries listed below
Sorting:
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ180Updated 9 months ago
- A Memento Aggregator CLI and Server in Goβ67Updated 5 months ago
- Social Feed Manager user interface application.β156Updated last year
- A commandline tool and Python library for archiving data from Facebook using the Graph API.β78Updated 7 years ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in tβ¦β127Updated last week
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ96Updated 6 years ago
- track changes to the news, where news is anything with an RSS feedβ178Updated 5 years ago
- A tool to detect whether a PDF has a bad redactionβ145Updated 3 weeks ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a headβ170Updated 5 years ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β166Updated last month
- Grabbing all news.β62Updated 5 years ago
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior Systemβ87Updated 4 years ago
- A collection of tools for archiving and analysing the internet.β77Updated 3 years ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developedβ¦β152Updated this week
- The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywordsβ75Updated 2 months ago
- A toolkit for CDX indices such as Common Crawl and the Internet Archive's Wayback Machineβ179Updated 7 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β86Updated 3 months ago
- Converts WARC files to static HTMLβ46Updated last year
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β163Updated 3 weeks ago
- Web archive index server based on RocksDBβ34Updated 3 weeks ago
- DEPRECATED. Desktop graph visualization applicationβ51Updated 2 years ago
- Chrome extension that uses Memento to indicate that a page a user is viewing on the live web has an archived copy and to give the user acβ¦β54Updated last month
- A helper library full of URL-related heuristics.β70Updated last month
- A LevelDB backed URL unshortening microservice written in JavaScriptβ31Updated 2 years ago
- Run Overview on your own systemβ125Updated 4 years ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β118Updated last week
- Centralised repository for WARC usage specifications.β115Updated 8 months ago
- Bot for operating snscrape in #archivebot on efnetβ11Updated 5 years ago
- Frontend component for Hoaxy, a tool to visualize the spread of claims and fact checkingβ72Updated 2 years ago