JonathanReeve / text-matcherLinks
A simple text reuse detection CLI tool.
☆135Updated last year
Alternatives and similar repositories for text-matcher
Users that are interested in text-matcher are comparing it to the libraries listed below
Sorting:
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Detect and align similar passages☆104Updated 2 months ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆63Updated 6 months ago
- High-performance text aligner for large collections of texts☆52Updated 2 months ago
- Contains materials for a work in progress - "A Humanist's Cookbook for Natural Language Processing in Python."☆40Updated 3 years ago
- Detect and visualize text reuse☆118Updated 10 months ago
- R package for stylometric analyses☆193Updated 6 months ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆30Updated 3 weeks ago
- Digital Humanities Across Borders☆48Updated last year
- A textual corpus database for the digital humanities.☆62Updated 4 years ago
- Python package for stylometry☆63Updated 4 years ago
- A command-line program to download text corpora.☆34Updated 7 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- This is code that we will cover in my Hacking the Humanities class at Leiden University. Video tutorials will be uploaded to my YouTube c…☆32Updated 6 years ago
- A Python wrapper around the topic modeling functions of MALLET.☆103Updated 8 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- A lemmatizer for German language text☆91Updated 2 years ago
- Project on the history of genre.☆23Updated 5 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 8 years ago
- Data Mining Historical Newspaper Metadata (METS/ALTO formats)☆25Updated 2 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- Distant Viewing Toolkit for the Analysis of Visual Culture☆98Updated 3 months ago
- ☆32Updated 2 years ago
- The Art of Literary Text Analysis☆166Updated 6 years ago
- ☆13Updated 2 years ago
- Annif is a multi-algorithm automated subject indexing tool for libraries, archives and museums.☆232Updated last week
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- A Flexible Deep Learning Approach to Fuzzy String Matching☆145Updated 9 months ago