edgi-govdata-archiving / web-monitoring
Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")
β108Updated 2 months ago
Alternatives and similar repositories for web-monitoring
Users that are interested in web-monitoring are comparing it to the libraries listed below
Sorting:
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ95Updated 6 years ago
- track changes to the news, where news is anything with an RSS feedβ178Updated 4 years ago
- A simple catalog of Twitter ID Datasetsβ28Updated 5 months ago
- Tools for access, "diff"-ing, and analyzing archived web pagesβ20Updated 3 weeks ago
- Enhanced Social Tagging for Academic Communitiesβ95Updated 7 months ago
- A Memento Aggregator CLI and Server in Goβ64Updated 2 months ago
- β61Updated 5 years ago
- UI to enable analysts to quickly assess changes to monitored government websitesβ37Updated 3 weeks ago
- A suite of focused and simple tools and activities for journalists, data journalism classrooms and community advocacy groupsβ63Updated last year
- Internet Research Agency Facebook ads as structured dataβ22Updated 5 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.β46Updated 7 years ago
- Scrapers for US municipal governments.β101Updated 11 months ago
- Run Overview on your own systemβ124Updated 3 years ago
- Save My News: A personal, permanent clipping serviceβ27Updated last year
- Materials to reproduce findings in our story, "Googleβs Top Search Result? Surprise! Itβs Google"β34Updated 4 years ago
- Web Archives for Historical Researchβ13Updated 7 years ago
- JavaScript app for displaying annotated network graphs from the LittleSis API and other data sourcesβ39Updated 7 years ago
- A Twitter data collection and appraisal application.β51Updated 2 years ago
- Prototype SOLR-powered web archive exploration UI.β43Updated 4 years ago
- ReproZip for the Preservation of Web Applicationsβ17Updated last year
- A list of things related to software, literature, and other content for π£ Mementoβ97Updated 11 months ago
- Browsertrix: Containerized High-Fidelity Browser-Based Automated Crawling + Behavior Systemβ87Updated 4 years ago
- A command line utility for listing and searching snapshots in web archivesβ16Updated last year
- A python client for the DPLA APIβ43Updated 2 years ago
- β49Updated last year
- A simple OpenRefine reconciliation service that runs on top of a CSV fileβ120Updated 9 years ago
- WARC and ARC indexing and discovery tools.β123Updated 2 months ago
- A push-button Digital Humanities laboratory.β126Updated 6 years ago
- Digital Preservation of HTTP in documentary heritage.β22Updated last year
- Interactive and searchable House staffer directory, based on House disbursement data.β27Updated last year