PIReTship / bookdata-tools
Tools for working with book data
☆18Updated 2 months ago
Alternatives and similar repositories for bookdata-tools:
Users that are interested in bookdata-tools are comparing it to the libraries listed below
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆14Updated 2 years ago
- All that entity matching, resolution, normalization, enhancement and reconciliation madness, but with a focus on data, not platforms.☆24Updated 2 years ago
- Python API for KB data-services☆18Updated 5 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆23Updated 11 months ago
- A collection of ipython/jupyter notebooks☆16Updated 6 years ago
- A proposed standard `NOCK` for a Parquet format that supports efficient distributed serialization of multiple kinds of graph technologies☆19Updated 2 years ago
- ☆12Updated 2 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆24Updated 2 years ago
- Named Entity Disambiguation and Linking☆15Updated 8 months ago
- Alignment, a collaborative, system aided, user driven ontology/vocabulary matching and validation platform.☆12Updated 2 years ago
- Topic Modeling Workflow in Python☆16Updated last year
- A set of workflows for corpus building through OCR, post-correction and normalisation☆48Updated 2 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆15Updated 5 years ago
- CSV on the web☆38Updated 3 months ago
- OpenRefine reconciler for Research Organization Registry☆13Updated 2 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆62Updated 2 years ago
- A workflow system for Natural Language Processing.☆21Updated 5 years ago
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆38Updated 5 years ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 3 months ago
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Loadable spellfix1 extension for sqlite as python package☆25Updated 9 months ago
- NLP pipeline software using common workflow language☆34Updated 5 years ago
- A structured list of text corpora, created for use with a corpus downloader.☆13Updated 8 years ago
- Free-form web data notebook - "Data management for little guys"☆26Updated last year
- Automatically exported from code.google.com/p/tdwg-rdf☆21Updated 5 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆29Updated 2 years ago
- Lakesuperior, an alternative Fedora Repository implementation☆32Updated 2 years ago