YaleDHLab / intertextLinks
Detect and visualize text reuse
☆118Updated last year
Alternatives and similar repositories for intertext
Users that are interested in intertext are comparing it to the libraries listed below
Sorting:
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- Python package for harvesting records from OAI-PMH provider(s).☆64Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated 2 years ago
- A simple text reuse detection CLI tool.☆136Updated last year
- Python package for stylometry☆63Updated 4 years ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Guess gender from first name in Python 2 and 3☆137Updated 3 months ago
- A command-line program to download text corpora.☆34Updated 8 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 7 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- A lemmatizer for German language text☆92Updated 2 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- A set of workflows for corpus building through OCR, post-correction and normalisation☆49Updated 3 years ago
- A machine learning tool for fishing entities☆266Updated 3 months ago
- Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup☆70Updated 3 months ago
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- Detect and align similar passages☆108Updated 4 months ago
- Custom French POS and lemmatizer based on Lefff for spacy☆68Updated 2 years ago
- Practical Approaches to Data Science with Text☆39Updated 5 years ago
- Scripts that clean up OCR and munge Hathi metadata.☆76Updated 7 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆37Updated 6 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 6 years ago
- Visualize large text collections with WebGL☆26Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆164Updated 2 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆182Updated 2 years ago
- A deep learning architecture for reference mining from literature in the arts and humanities.☆16Updated 6 years ago
- Poetic processing, for Python.☆42Updated last year
- Extract dates from text☆65Updated 4 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆64Updated last week
- A Named-Entity Recogniser based on Grobid.☆54Updated 4 months ago