jfilter / hyperhyper
🧮 Python package to construct word embeddings for small data using PMI and SVD
☆17Updated 4 years ago
Alternatives and similar repositories for hyperhyper:
Users that are interested in hyperhyper are comparing it to the libraries listed below
- Coreference resolution for German☆16Updated 7 years ago
- A Python library for topic modeling and visualization☆65Updated 4 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- Explore your own text collection with a topic model – without prior knowledge.☆62Updated 3 months ago
- Python package for stylometry☆63Updated 4 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- Analyze Argumentation and Rhetorical Aspects in Scientific Writing.☆19Updated 2 years ago
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Literary Language Toolkit: code, models, corpora, and web tools☆11Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆80Updated last year
- Quick implementation of Monroe et al.'s algorithm for comparing languages☆52Updated 4 years ago
- Running Prodigy for a team of annotators☆53Updated 4 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 3 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆41Updated last year
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- ☆70Updated 2 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- The Potsdam Twitter Sentiment Corpus☆17Updated 5 years ago
- Data and code for analyzing language associated with fictional characters.☆15Updated 7 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Updated 6 years ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 9 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆161Updated 2 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated 2 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Updated 2 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆58Updated last year
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆29Updated 6 years ago
- Harassment Lexicon and Corpus☆30Updated 6 years ago