dasmiq / passim
Detect and align similar passages
☆100Updated 2 months ago
Alternatives and similar repositories for passim:
Users that are interested in passim are comparing it to the libraries listed below
- You Actually Look Twice At it☆33Updated 3 months ago
- Digital Humanities Across Borders☆47Updated last year
- ☆28Updated 4 years ago
- High-performance text aligner for large collections of texts☆51Updated last week
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- A hands-on activity in linking and enriching geo-data, part of the Linked Pasts conference☆14Updated 4 years ago
- Text collections made available by the CLiGS group.☆23Updated 3 years ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆22Updated 5 years ago
- Early Novels Database dataset☆16Updated 6 years ago
- Named entity annotation tool☆27Updated last year
- ☆33Updated 10 months ago
- Python implementation of the Zeta score for contrastive text analysis☆14Updated 3 years ago
- This research seeks to examine best practice in the field of digital editions by collating relevant evidence in a detailed catalogue of e…☆53Updated 3 weeks ago
- Tools for working with HTRC Feature Extraction files☆39Updated 3 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- A DH abstracts conversion tool☆11Updated last month
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- pydistinto - a Python implementation of different measures of distinctiveness for contrastive text analysis☆10Updated 2 years ago
- Python tools for performing various operations on ALTO XML files☆46Updated last month
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆40Updated 3 years ago
- Diachronic Spanish Sonnet Corpus. Canonical and minor authors in Spanish (Europe, America and Asia): 15th to 20th century☆16Updated last year
- Archive of the XML files of the Mannheim / Heidelberg CAMENA Neo-Latin project☆19Updated 6 years ago
- CollateX – Software for Collating Textual Sources☆92Updated last year
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆10Updated 10 months ago
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- Literary Language Toolkit: code, models, corpora, and web tools☆11Updated last year