edgi-govdata-archiving / web-monitoring
Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")
β108Updated last month
Alternatives and similar repositories for web-monitoring:
Users that are interested in web-monitoring are comparing it to the libraries listed below
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ95Updated 6 years ago
- A Memento Aggregator CLI and Server in Goβ62Updated 3 weeks ago
- track changes to the news, where news is anything with an RSS feedβ178Updated 4 years ago
- A simple catalog of Twitter ID Datasetsβ28Updated 4 months ago
- Tools for access, "diff"-ing, and analyzing archived web pagesβ20Updated this week
- Enhanced Social Tagging for Academic Communitiesβ95Updated 5 months ago
- ReproZip for the Preservation of Web Applicationsβ17Updated 10 months ago
- A place to collect and share knowledge about liberating data from PDFsβ54Updated 3 years ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β85Updated 2 weeks ago
- "Old SFM" -- manage rules and streams from social data sources, starting with twitter.β86Updated last year
- wabac.js - Web Archive Browsing Augmentation Clientβ106Updated this week
- Internet Research Agency Facebook ads as structured dataβ22Updated 5 years ago
- WARC and ARC indexing and discovery tools.β122Updated 3 weeks ago
- WASAPI data transfer APIsβ44Updated 2 years ago
- Social Feed Manager user interface application.β155Updated 9 months ago
- Digital Preservation of HTTP in documentary heritage.β22Updated last year
- Web Archives for Historical Researchβ13Updated 7 years ago
- A LevelDB backed URL unshortening microservice written in JavaScriptβ31Updated 2 years ago
- JavaScript app for displaying annotated network graphs from the LittleSis API and other data sourcesβ39Updated 7 years ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β39Updated this week
- MuckRock's source code - Please report bugs, issues and feature requests to info@muckrock.comβ113Updated this week
- Tools for tracking stories on news homepagesβ48Updated 5 years ago
- Materials to reproduce findings in our story, "Googleβs Top Search Result? Surprise! Itβs Google"β34Updated 4 years ago
- A list of things related to software, literature, and other content for π£ Mementoβ96Updated 10 months ago
- Run Overview on your own systemβ123Updated 3 years ago
- Library of Congress coding standardsβ30Updated 9 months ago
- Adding links to full text in Wikipedia referencesβ37Updated last year
- A suite of focused and simple tools and activities for journalists, data journalism classrooms and community advocacy groupsβ62Updated 11 months ago
- A Twitter data collection and appraisal application.β51Updated 2 years ago
- Grabbing all news.β62Updated 5 years ago