JonathanReeve / text-matcher
A simple text reuse detection CLI tool.
☆126Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for text-matcher
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- High-performance text aligner for large collections of texts☆45Updated 3 weeks ago
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆22Updated last year
- Detect and align similar passages☆88Updated 2 months ago
- Python tools for performing various operations on ALTO XML files☆39Updated last year
- Lexicons for the Multilingual UCREL Semantic Analysis System☆39Updated last year
- Python package for harvesting records from OAI-PMH provider(s).☆62Updated 2 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆62Updated 3 weeks ago
- Project on the history of genre.☆22Updated 4 years ago
- ☆28Updated 3 years ago
- Topic Words in Context (TWiC) is a highly-interactive, browser-based visualization for MALLET topic models☆51Updated 7 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆91Updated last year
- Workshop materials for our DH2018 workshop on word vectors. Created by Eun Seo Jo, Javier de la Rosa, and Scott Bailey☆15Updated 6 years ago
- A textual corpus database for the digital humanities.☆59Updated 4 years ago
- An OCR evaluation tool☆64Updated last month
- A software to detect text reuse with BLAST.☆14Updated 5 years ago
- Distant Viewing Toolkit for the Analysis of Visual Culture☆91Updated 2 months ago
- Deutsches Lyrik Korpus (DLK) / German Poetry Corpus☆17Updated 6 months ago
- Visual Text Analytics for Digital Humanities☆17Updated 9 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆74Updated 7 years ago
- Detect and visualize text reuse☆115Updated 2 months ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆124Updated 3 years ago
- Text Re-use Alignment Visualization☆37Updated 7 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- A simple collocation-driven recognition of rhymes. Contains pre-trained models for Czech, Dutch, English, French, German, Russian, and Sp…☆29Updated 3 years ago
- Digital Humanities Across Borders☆46Updated 8 months ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- A script to generate tagged XML Citationstrings for citation parsing☆18Updated 4 years ago
- ☆14Updated 2 years ago