inertia-lab / bookdata-toolsLinks
Tools for working with book data
☆18Updated last month
Alternatives and similar repositories for bookdata-tools
Users that are interested in bookdata-tools are comparing it to the libraries listed below
Sorting:
- Documents for the project Libraccess☆13Updated 10 years ago
- Python API for KB data-services☆19Updated 5 years ago
- Python package to reconcile DataFrames☆24Updated 2 years ago
- Process, enhance and evaluate multiple OCR output.☆24Updated last year
- Docker image for the Archives Unleashed Toolkit☆12Updated 3 years ago
- Adds a reconciliation API endpoint to Datasette, based on the Reconciliation Service API specification.☆24Updated last year
- Python tools for performing various operations on ALTO XML files☆48Updated 9 months ago
- Citation Classification using hybrid neural network model for Wikipedia References☆31Updated 2 years ago
- Web application for transcribing OCR ground truth from Archive.org☆17Updated 7 years ago
- Lakesuperior, an alternative Fedora Repository implementation☆32Updated 3 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 3 years ago
- A collection of ipython/jupyter notebooks☆16Updated 6 years ago
- Convert between Tesseract hOCR and ALTO XML using XSL stylesheets☆58Updated 2 months ago
- Convert SPARQL results to a pandas dataframe☆28Updated last year
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 3 years ago
- 🗣 Multilingual RDF Verbalizer – Google Summer of Code 2019☆21Updated 2 years ago
- Example SPARQL queries, mostly for working with ZBW data sets☆16Updated last month
- Mario is a metadata processing pipeline that will process data from various sources and write to Elasticsearch☆13Updated 2 years ago
- VIAF via Python☆12Updated 5 months ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 8 years ago
- This repository has migrated to:☆100Updated last month
- Web application to try out reconciliation services interactively☆13Updated last week
- Text-Induced Corpus Clean-up☆20Updated 2 years ago
- Java based viewer for PAGE XML files (layout + text content). Also supports ALTO XML, FineReader XML, and HOCR.☆35Updated 2 years ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated 5 months ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆23Updated 5 years ago
- A fast, responsive HTML5 viewer for scanned items, developed for the World Digital Library. A project of the Library of Congress. Note: p…☆22Updated 10 years ago
- Named entity annotation tool☆28Updated 2 years ago
- Knowledge graph construction: Fast inserts into a Wikibase instance☆46Updated 3 years ago
- OCFL implementation for Go☆15Updated 2 weeks ago