bmschmidt / nonconsumptive
Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.
☆14Updated 3 years ago
Alternatives and similar repositories for nonconsumptive:
Users that are interested in nonconsumptive are comparing it to the libraries listed below
- A Mashup Interface for Text Analysis Operations☆13Updated 4 months ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Simple command line oai-pmh harvester written in Python.☆41Updated 2 years ago
- Tools for working with HTRC Feature Extraction files☆39Updated 3 months ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Updated 6 months ago
- VIAF via Python☆12Updated last year
- Download and manipulate HathiTrust wordcount data in the tidyverse☆9Updated 3 years ago
- ☆28Updated 4 years ago
- Interactive TOpic Model and MEtadata Visualization. Live at: tome.lmc.gatech.edu☆13Updated 5 years ago
- Text Re-use Alignment Visualization☆38Updated 7 years ago
- Rails application with Blazegraph for managing controlled vocabularies in RDF.☆22Updated last year
- Awesome AI in Libraries☆16Updated last year
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 5 years ago
- A study group for v4 of the fastai introduction to deep learning course with a focus on applications in GLAM settings☆15Updated 3 years ago
- How About Machine Learning Enhancing Theses? - a pilot discovery project☆14Updated last year
- Code repository for whatisdigitalhumanities.com☆32Updated 2 years ago
- Project DAHN "Digital Edition of historical manuscripts (correspondences)"☆15Updated 5 months ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- Extract, transform, and analyze bibliographic data from Wikidata dumps☆27Updated 2 years ago
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- ☆12Updated last year
- Named entity annotation tool☆27Updated last year
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆77Updated 2 weeks ago
- Scripts to create git repositories for ALTO XML texts, like those from the British Library's scanned documents.☆31Updated 7 years ago
- This is a public repository for sharing, improving, and versioning "The Topic Modeling Game," a lesson developed by Lisa Rhody to teach t…☆10Updated 6 years ago
- Project on the history of genre.☆22Updated 5 years ago
- Heritage Connector: Transforming text into data to extract meaning and make connections☆24Updated 2 years ago
- Bagit-based data packaging specification for dissemination of research data with useful human and machine readable metadata: "Make Data C…☆38Updated 5 years ago