aolieman / waywardLinks

Wayward is a Python package that helps to identify characteristic terms from single documents or groups of documents. It can be used for keyword extraction and several related tasks, and can create efficient sparse representations for classifiers. It was originally created to provide term weights for word clouds.

☆9

Alternatives and similar repositories for wayward

Users that are interested in wayward are comparing it to the libraries listed below

Sorting:

KBNLresearch / dac
Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…
☆11Updated 2 years ago
wjbmattingly / bagpipes-spacy
Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.
☆18Updated 11 months ago
kermitt2 / grobid-ner
A Named-Entity Recogniser based on Grobid.
☆54Updated 2 months ago
UB-Mannheim / bbw
Entity linking, entity typing and relation extraction: Matching CSV to a Wikibase instance (e.g., Wikidata) via Meta-lookup
☆70Updated 2 months ago
BramVanroy / spacy-extreme
An example of how to use spaCy for extremely large files without running into memory issues
☆36Updated 2 years ago
fnielsen / wembedder
Wikidata embedding
☆50Updated 9 months ago
tudarmstadt-lt / GermaNER
GermaNER: Free Open German Named Entity Recognition Tool
☆36Updated last year
jplu / ADEL
ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…
☆19Updated 5 years ago
brandontlocke / NERtwork
NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…
☆48Updated last year
impresso / named-entity-tutorial-dh2019
Tutorial on NE processing for Digital Humanities - DH Utrech 2019
☆25Updated 6 years ago
alexerdmann / HER
Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist
☆37Updated 6 years ago
Commonists / pageview-api
Wikimedia Pageview API client
☆28Updated 7 years ago
gkiril / MinSCIE
MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.
☆15Updated 6 years ago
nikolamilosevic86 / TableDisentangler
Functional and structural analysis of tables in research papers (Table disentangling)
☆20Updated 8 years ago
microsoft / spacy-ann-linker
spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking
☆85Updated 2 years ago
UB-Mannheim / spacyopentapioca
A spaCy wrapper of OpenTapioca for named entity linking on Wikidata
☆94Updated 2 years ago
NewsEye / NLP-Notebooks-Newspaper-Collections
A collection of notebooks for Natural Language Processing
☆25Updated 6 months ago
dkpro / dkpro-c4corpus
DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…
☆52Updated 5 years ago
Liebeck / spacy-iwnlp
German lemmatization with IWNLP as extension for spaCy
☆24Updated 2 years ago
alephdata / synonames
Trying to generate name synonyms from wikidata
☆32Updated 5 years ago
jkkummerfeld / slate
A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python
☆111Updated 2 months ago
cvbrandoe / REDEN
Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts
☆27Updated 3 years ago
krzysiekfonal / grammaregex
Regex like pattern tree matching but on sentence's tree instead of Strings
☆42Updated 7 years ago
chozelinek / europarl
Toolkit to compile a comparable/parallel corpus from European Parliament proceedings
☆16Updated 5 years ago
dbmdz / historic-ner
Repository for "Towards Robust Named Entity Recognition for Historic German"
☆18Updated 4 years ago
revuel / PatternOmatic
Finds linguistic patterns effortlessly
☆37Updated last year
nlppln / nlppln
NLP pipeline software using common workflow language
☆34Updated 6 years ago
Harshdeep1996 / cite-classifications-wiki
Citation Classification using hybrid neural network model for Wikipedia References
☆30Updated 2 years ago
iptc / extra
Homebase of the IPTC EXTRA project about rule-based text categorization
☆13Updated 8 years ago
Living-with-machines / DeezyMatch
A Flexible Deep Learning Approach to Fuzzy String Matching
☆146Updated 9 months ago