edgi-govdata-archiving / web-monitoring-processing
Tools for access, "diff"-ing, and analyzing archived web pages
☆20Updated 3 weeks ago
Alternatives and similar repositories for web-monitoring-processing
Users that are interested in web-monitoring-processing are comparing it to the libraries listed below
Sorting:
- Documentation and project-wide issues for the Website Monitoring project (a.k.a. "Scanner")☆108Updated 2 months ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆26Updated 9 months ago
- A card game that helps organizations and communities explore governance around a shared codebase, whether hypothetical or in a real-world…☆41Updated last month
- Web application to allow users to add content metadata about crawled resources☆13Updated 7 years ago
- UI to enable analysts to quickly assess changes to monitored government websites☆37Updated 3 weeks ago
- 📚 Chrome extension to nominate government data that needs to be preserved☆20Updated 4 years ago
- 🛠️ A library for mapping CKAN metadata <=> Frictionless metadata☆9Updated 2 years ago
- Specification for authentication and creating signed WACZ Files☆10Updated 3 years ago
- Carles Pina Estany's 2020 Tool Fund: data managers and researchers collaborate to write the Frictionless Data packages, tabular schemas, …☆17Updated 2 years ago
- Adding links to full text in Wikipedia references☆37Updated last year
- A simple catalog of Twitter ID Datasets☆28Updated 5 months ago
- 📚 Monthly reading group for Data Together☆42Updated 4 years ago
- The DDI Discovery Vocabulary, an RDF vocabulary for data description and discovery based on DDI☆25Updated 2 years ago
- The One True Open Access Button - cross-compatible extension for research papers and data.☆45Updated 7 months ago
- Save My News: A personal, permanent clipping service☆27Updated last year
- Codemeta paper.☆10Updated 7 years ago
- The main repository of the Frictionless Data project. Website, issues, and discussions☆141Updated 2 weeks ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- Specification to describe the minimum information standard for online community data. Guidelines for describing data about online communi…☆11Updated 8 years ago
- Legislative Branch Innovation Hub☆46Updated 3 weeks ago
- 'Git for Tabular Data'☆46Updated 8 years ago
- Backports for ckan.plugins.toolkit to ease CKAN extension compatibility☆17Updated 3 years ago
- The CFPB's official Source Code Policy.☆37Updated 3 years ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Updated 2 years ago
- [DEPRECATED] Please use - https://github.com/frictionlessdata/frictionless-py☆13Updated 4 years ago
- The corporate repository where we discuss our serious business☆22Updated 2 months ago
- Library of Congress coding standards☆30Updated 11 months ago
- Information around TimBL's 5 star Open Data plan☆73Updated 5 months ago
- This is a basic instance of the D-Net software toolkit, a software framework for the realization of aggregative data infrastructures.☆15Updated 3 years ago
- Rig for deploying DocumentCloud viewers to S3.☆13Updated 3 years ago