jfilter / hyperhyper
🧮 Python package to construct word embeddings for small data using PMI and SVD
☆16Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for hyperhyper
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- Visual analytics application for qualitative text analysis☆24Updated last year
- extract relationships from standardized terms from corpus of interest with deep learning☆20Updated 4 years ago
- Text readability metrics in Python.☆12Updated 11 years ago
- A package to easily train Bert-like models for text classification☆14Updated last year
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Repository for code and metadata to support work described in "Authorless Topic Models: Biasing Models Away from Known Structure"☆28Updated 4 years ago
- RESTful API around the PETRARCH coding software☆10Updated 3 years ago
- 📚 Online archive for annual reports of the German internal intelligence☆11Updated last week
- A part-of-speech tagger with support for domain adaptation and external resources.☆22Updated 2 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated last year
- Github mirror - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access for contributing)☆47Updated 5 months ago
- Python based Wikidata framework for easy dataframe extraction☆39Updated 11 months ago
- An index data structure for approximate string search.☆23Updated 5 years ago
- Lexicons for the Multilingual UCREL Semantic Analysis System☆39Updated last year
- ☆22Updated 3 years ago
- ☆70Updated last year
- Extract networks of entities from journalistic reporting☆47Updated last year
- Scrapes the web. Gets the news.☆13Updated 8 years ago
- Fast, flexible extraction of moral information from textual input data.☆103Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆53Updated 4 months ago
- Unsupervised method for extracting quotation-speaker pairs from large news corpora.☆28Updated 6 years ago
- Code for learning geographically-informed word embeddings☆22Updated 2 years ago
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆12Updated 5 years ago
- Turning news into events since 2014.☆50Updated 7 years ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆91Updated last year
- ☆24Updated last year