JonathanReeve / text-matcherLinks
A simple text reuse detection CLI tool.
☆133Updated 11 months ago
Alternatives and similar repositories for text-matcher
Users that are interested in text-matcher are comparing it to the libraries listed below
Sorting:
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- A textual corpus database for the digital humanities.☆62Updated 4 years ago
- Detect and align similar passages☆102Updated 3 weeks ago
- High-performance text aligner for large collections of texts☆51Updated last month
- Morphological analyzer and lemmatizer for Latin.☆27Updated 4 months ago
- Python library for automatic analysis of Ancient Greek hexameter. The algorithm uses linguistic rules and finite-state technology.☆20Updated last year
- Explore your own text collection with a topic model – without prior knowledge.☆63Updated 5 months ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 3 years ago
- Project on the history of genre.☆23Updated 5 years ago
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆30Updated 3 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- Extension for pie to include taggers with their models and pre/postprocessors☆10Updated last year
- Text collections made available by the CLiGS group.☆23Updated 3 years ago
- Python tools for performing various operations on ALTO XML files☆47Updated 3 months ago
- Visual Text Analytics for Digital Humanities☆17Updated 10 years ago
- Python package for stylometry☆63Updated 4 years ago
- A tool to extract canonical references from text.☆20Updated 3 years ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆40Updated 3 years ago
- linguistics backend☆41Updated 2 years ago
- An R package for analysis of dramatic texts☆15Updated 2 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆42Updated last year
- Supervised Stylometry☆23Updated this week
- Diachronic Spanish Sonnet Corpus. Canonical and minor authors in Spanish (Europe, America and Asia): 15th to 20th century☆16Updated last year
- Tutorials for the CLTK☆53Updated 4 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆18Updated last year
- In-browser OCR of Ancient Greek and Latin☆26Updated last month