aolieman / wayward
Wayward is a Python package that helps to identify characteristic terms from single documents or groups of documents. It can be used for keyword extraction and several related tasks, and can create efficient sparse representations for classifiers. It was originally created to provide term weights for word clouds.
☆9Updated 5 years ago
Alternatives and similar repositories for wayward:
Users that are interested in wayward are comparing it to the libraries listed below
- Entity linker for the newspaper collection of the National Library of the Netherlands. Links named entity mentions to DBpedia description…☆11Updated 2 years ago
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Tutorial on NE processing for Digital Humanities - DH Utrech 2019☆25Updated 5 years ago
- A collection of notebooks for Natural Language Processing☆25Updated 3 months ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆19Updated 5 years ago
- spaCy-to-naf converter☆21Updated 10 months ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Wikidata embedding☆50Updated 5 months ago
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated last year
- GC4LM: A Colossal (Biased) language model for German☆13Updated 3 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆17Updated 8 months ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 3 years ago
- ☆16Updated 10 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Finds linguistic patterns effortlessly☆36Updated last year
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- Humanities Entity Recognition: robust, practical, efficient Named Entity Recognition for today's digital humanist☆36Updated 6 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…