aolieman / wayward
Wayward is a Python package that helps to identify characteristic terms from single documents or groups of documents. It can be used for keyword extraction and several related tasks, and can create efficient sparse representations for classifiers. It was originally created to provide term weights for word clouds.
☆9Updated 5 years ago
Alternatives and similar repositories for wayward:
Users that are interested in wayward are comparing it to the libraries listed below
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆17Updated 5 years ago
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆12Updated 5 months ago
- A collection of notebooks for Natural Language Processing☆25Updated last week
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated last year
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆49Updated 9 months ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated 10 months ago
- Python tools for text☆15Updated 4 years ago
- Trying to generate name synonyms from wikidata☆33Updated 4 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 5 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago
- Finds linguistic patterns effortlessly☆34Updated last year
- ☆30Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Wikidata embedding☆51Updated 2 months ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 5 months ago
- ☆17Updated 9 years ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆38Updated 5 years ago
- Implementation of a simple frame identification approach (SimpleFrameId) described in the paper "Out-of-domain FrameNet Semantic Role Lab…☆15Updated 7 years ago
- Tool for sentiment analysis annotation☆11Updated 3 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆86Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆37Updated 3 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆50Updated 4 years ago
- TeXoo – A Zoo of Text Extractors☆18Updated 4 years ago
- Rich Context leaderboard competition, including the corpus and current SOTA for required tasks.☆21Updated 4 years ago