JonathanReeve / text-matcher
A simple text reuse detection CLI tool.
☆132Updated 11 months ago
Alternatives and similar repositories for text-matcher
Users that are interested in text-matcher are comparing it to the libraries listed below
Sorting:
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- High-performance text aligner for large collections of texts☆51Updated last week
- Python library for automatic analysis of Ancient Greek hexameter. The algorithm uses linguistic rules and finite-state technology.☆20Updated last year
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆40Updated 3 years ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- Detect and align similar passages☆100Updated 3 months ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Explore your own text collection with a topic model – without prior knowledge.☆62Updated 4 months ago
- Text collections made available by the CLiGS group.☆23Updated 3 years ago
- Repository for the book Among Digitized Manuscripts by L.W. Cornelis van Lit (Leiden: Brill, 2020)☆22Updated 5 years ago
- A collection of Jupyter notebooks in many human and computer languages for doing digital humanities. PRs welcome!☆129Updated last year
- Visual Text Analytics for Digital Humanities☆17Updated 10 years ago
- A bunch of modules that use/extend CLTK in order to work with Greek and Latin corpora maintained by the Perseus DL☆11Updated 5 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated last year
- A tool to extract canonical references from text.☆20Updated 3 years ago
- HuCit KB: a knowledge base of classical texts and citable text units.☆11Updated 3 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Sunoikisis Digital Classics 2019-2020 syllabuses☆12Updated 3 years ago
- Project on the history of genre.☆23Updated 5 years ago
- A textual corpus database for the digital humanities.☆62Updated 4 years ago
- A general-purpose NLP pipeline for Ancient Greek☆22Updated last year
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 3 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆10Updated 11 months ago
- Topic Modeling Workflow in Python☆16Updated 2 years ago
- Detect and visualize text reuse☆118Updated 8 months ago
- Morphological analyzer and lemmatizer for Latin.☆27Updated 3 months ago
- ☆21Updated 4 months ago
- Diachronic Spanish Sonnet Corpus. Canonical and minor authors in Spanish (Europe, America and Asia): 15th to 20th century☆16Updated last year