vinaygoel / archive-analysis
Tools to analyze web archives
☆20Updated 8 years ago
Alternatives and similar repositories for archive-analysis:
Users that are interested in archive-analysis are comparing it to the libraries listed below
- Using social media to steer web archiving and curation.☆15Updated 9 years ago
- Fcrepo4 webapp plus optional fcrepo dependencies☆13Updated 4 years ago
- Docker image for the Archives Unleashed Toolkit☆12Updated 2 years ago
- A Rails engine supporting the discovery of web archives.☆50Updated last year
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Rails application to support the Sloan Dash grant project for self-deposit submission of scholarly works.☆16Updated 5 years ago
- WASAPI data transfer APIs☆43Updated 2 years ago
- This repository contains tool and collections dataset for detecting off-topic pages from Web archived collections.☆18Updated 9 years ago
- The bibfra.me vocabulary☆10Updated 2 years ago
- A collection of ipython/jupyter notebooks☆16Updated 6 years ago
- This repo holds the source code for the web application☆15Updated last year
- Adds the ability to transcribe items using the Scripto library.☆17Updated 6 months ago
- The "VIVO-ISF Ontology" is an OWL2 representation of the VIVO-ISF Data Standard☆16Updated 5 years ago
- Rails application for the Archives Unleashed Cloud.☆11Updated 3 years ago
- Markdown for Linked Data☆16Updated 9 years ago
- ☆16Updated 9 years ago
- Web application for distributed compute analysis of Archive-It web archive collections.☆15Updated 5 months ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- A Data Parsing/Data Manipulation Tool Supporting Digitization Projects and Other Data Analysis Projects☆47Updated 5 years ago
- Adding links to full text in Wikipedia references☆37Updated last year
- A comprehensive graph of mathematical domains and topics☆20Updated 3 years ago
- Web Tables Automatic Property Mapping☆7Updated 5 years ago
- code to remove "noise" from hOCR output of Tesseract OCR.☆14Updated 8 years ago
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- A project to coordinate implementing a system to signal whether references cited on Wikipedia are free to reuse☆19Updated 8 years ago
- Python API for KB data-services☆19Updated 5 years ago
- Trough: Big data, small databases.☆40Updated 6 months ago
- Free-form web data notebook - "Data management for little guys"☆26Updated 2 years ago
- The Web Curator Tool is a tool for managing the selective web harvesting process. (moved from SourceForge). https://webcurator.slack.com …☆27Updated 2 years ago
- an RDF datastore that gives researchers control over the sharing of data between datasets☆41Updated 9 months ago