miku / siskin
Tasks around metadata.
☆21Updated last month
Alternatives and similar repositories for siskin
Users that are interested in siskin are comparing it to the libraries listed below
Sorting:
- Python library and command line tool for converting data from one format to another☆99Updated 4 years ago
- A markdown wiki and dashboarding system for Datasette☆21Updated 3 years ago
- Markdown for Linked Data☆16Updated 10 years ago
- This a module to extract RDF from an HTML5 page annotated with microdata. The module implements the algorithm defined and published by th…☆44Updated 2 years ago
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 8 years ago
- Utils around luigi.☆66Updated 4 years ago
- Free-form web data notebook - "Data management for little guys"☆26Updated last month
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆30Updated 2 years ago
- Mirror of https://gerrit.wikimedia.org/g/purtle See https://www.mediawiki.org/wiki/Developer_access for contributing)☆11Updated 2 weeks ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 3 years ago
- Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.☆46Updated 7 years ago
- The NYU Data Catalog facilitates researchers’ access to large datasets available either publicly or through institutional or individual l…☆29Updated last year
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- A framework to allow the matching of string entities using customised sets of transformations and matchers, plus a tool to produce the ne…☆31Updated 8 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 8 years ago
- Process, enhance and evaluate multiple OCR output.☆22Updated 6 months ago
- Python language parser for a tabular format for structured metadata. http://metatab.org☆18Updated last year
- Check out https://github.com/webrecorder/webrecorder for newer version matching https://webrecorder.io☆38Updated 9 years ago
- Linked Open Vocabularies (LOV) - scripts☆9Updated 8 years ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- Prototype SOLR-powered web archive exploration UI.☆43Updated 4 years ago
- Datasette plugin for visualizing data using Vega☆58Updated last year
- Trough: Big data, small databases.☆41Updated 9 months ago
- Datasette plugin for executing SQL queries from templates☆10Updated 4 years ago
- OpenRefine is a free, open source power tool for working with messy data and improving it. This repository contains Dockerbuild files fo…☆21Updated 3 years ago
- agate-sql adds SQL read/write support to agate.☆18Updated 2 months ago
- neonion is a user-centered collaborative semantic annotation webapp developed at the Human-Centered Computing group at Freie Universität …☆68Updated 6 years ago
- This is the ETL lib package. It provides an API to munge and prepare JSON, TSV and other data using Apache Tika and JSON parsing/loading …☆17Updated last year
- produce a stream of citiation data coming off wikimedia☆12Updated 8 years ago
- Add editing UI and other power-user features to Datasette.☆12Updated 2 years ago