JonathanReeve / text-matcher
A simple text reuse detection CLI tool.
☆129Updated 7 months ago
Alternatives and similar repositories for text-matcher:
Users that are interested in text-matcher are comparing it to the libraries listed below
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Visual Text Analytics for Digital Humanities☆17Updated 9 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 4 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Python tools for performing various operations on ALTO XML files☆40Updated last year
- A textual corpus database for the digital humanities.☆60Updated 4 years ago
- Annotation tool for coreference☆32Updated last year
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- High-performance text aligner for large collections of texts☆47Updated 3 months ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- A command-line program to download text corpora.☆34Updated 7 years ago
- A collection of Jupyter notebooks in many human and computer languages for doing digital humanities. PRs welcome!☆125Updated last year
- Digital Humanities Across Borders☆47Updated 10 months ago
- Python 3 library for processing historical English☆64Updated 5 months ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆39Updated 3 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago
- A Pythonic API and some command line tools to access the Transkribus server via its REST API☆27Updated 2 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆62Updated 2 years ago
- Corpus of Spanish Golden-Age Sonnets (with metrical annotation) / Corpus de Sonetos del Siglo de Oro (con anotación métrica)☆35Updated 2 years ago
- pydistinto - a Python implementation of different measures of distinctiveness for contrastive text analysis☆10Updated last year
- ☆14Updated 2 years ago
- 🏖TagEditor - Annotation tool for spaCy☆190Updated 2 years ago
- Umbrella repository that describes the collections contained in any given release of ELTeC☆13Updated 3 years ago
- ☆28Updated 3 years ago
- A Python wrapper around the topic modeling functions of MALLET.☆102Updated 2 months ago
- Detect and align similar passages☆95Updated 2 months ago
- You Actually Look Twice At it☆29Updated last week