YaleDHLab / intertext
Detect and visualize text reuse
☆118Updated 6 months ago
Alternatives and similar repositories for intertext:
Users that are interested in intertext are comparing it to the libraries listed below
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Updated 3 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆158Updated 2 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆112Updated 2 months ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- High-performance text aligner for large collections of texts☆50Updated 5 months ago
- German sentiment scores with SentiWS as extension for spaCy☆37Updated 2 years ago
- Trying to generate name synonyms from wikidata☆32Updated 4 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- Visualize large text collections with WebGL☆25Updated 6 months ago
- SEM, a free NLP tool relying on machine learning technologies, especially CRFs.☆24Updated 3 years ago
- Detect and align similar passages☆98Updated 2 months ago
- ☆19Updated 6 months ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated last year
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- A lemmatizer for German language text☆88Updated 2 years ago
- Literary Language Toolkit: code, models, corpora, and web tools☆11Updated last year
- A browser user interface for manual labeling of record pairs.☆46Updated last year
- Python port for IWNLP.Lemmatizer☆17Updated last year
- A textual corpus database for the digital humanities.☆61Updated 4 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆62Updated 2 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Poetic processing, for Python.☆40Updated 11 months ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated 11 months ago
- ☆11Updated 3 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 3 years ago
- Annotation Management for Prodigy, that support multiple users working in many projects☆15Updated 6 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago