edgi-govdata-archiving / web-monitoring-processing
Tools for access, "diff"-ing, and analyzing archived web pages
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for web-monitoring-processing
- Specification for authentication and creating signed WACZ Files☆9Updated 2 years ago
- UI to enable analysts to quickly assess changes to monitored government websites☆37Updated last month
- Metadata and per-statute PDFs for the U.S. Statutes at Large through volume 64 (1789-1951).☆14Updated 4 years ago
- Trough: Big data, small databases.☆40Updated 3 months ago
- Library of Congress coding standards☆29Updated 5 months ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆25Updated 3 months ago
- A LevelDB backed URL unshortening microservice written in JavaScript☆31Updated last year
- Adding links to full text in Wikipedia references☆37Updated 10 months ago
- generic extraction recipes to get you started extracting schema.org entities for your software, data, and all things☆14Updated 5 years ago
- Ask questions about government data.☆37Updated 5 years ago
- Data Seal is a lightweight, UELMA-compliant data authentication service.☆32Updated 9 years ago
- A simple catalog of Twitter ID Datasets☆28Updated 2 months ago
- A prototype server to swarm multiple DATs for Webrecorder☆13Updated 5 years ago
- Backports for ckan.plugins.toolkit to ease CKAN extension compatibility☆15Updated 2 years ago
- Legislative data from the congress repository☆19Updated 11 years ago
- Materials to reproduce findings in our story, "Google’s Top Search Result? Surprise! It’s Google"☆34Updated 4 years ago
- A library for making web services that make functions available as synchronous or asynchronous jobs☆21Updated last year
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆43Updated last year
- Web application to allow users to add content metadata about crawled resources☆13Updated 6 years ago
- Carles Pina Estany's 2020 Tool Fund: data managers and researchers collaborate to write the Frictionless Data packages, tabular schemas, …☆16Updated last year
- A Memento TimeGate☆40Updated 4 years ago
- Parser for U.S. federal regulations and other regulatory information☆38Updated last year
- Open source and open knowledge (data and content) licenses together with API and web service.☆65Updated 4 months ago
- An organization chart for the government of the United States.☆38Updated 10 years ago
- The Federal Election Commission's web-based application that makes regulations easier to find, read and understand.☆33Updated 6 months ago
- Organizing and publishing the web domains of the US federal government☆16Updated 6 years ago
- A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (me…☆15Updated 3 years ago
- Collecting reports from Inspectors General across the US federal government.☆107Updated 3 years ago
- Specification to describe the minimum information standard for online community data. Guidelines for describing data about online communi…☆11Updated 8 years ago
- a JavaScript plugin to warn users about links to private pages☆10Updated 2 years ago