IBM / comparing-corporaLinks
A python library of similarity measures which allow measuring the perceptual similarity between set embeddings corpora.
☆14Updated last month
Alternatives and similar repositories for comparing-corpora
Users that are interested in comparing-corpora are comparing it to the libraries listed below
Sorting:
- A Python wrapper around the topic modeling functions of MALLET.☆102Updated 7 months ago
- An easy and robust model for Lexical Semantic Change Detection☆14Updated last year
- A simple toolkit for conducting analyses using corpus methods☆25Updated 3 years ago
- Training Temporal Word Embeddings with a Compass☆64Updated 2 years ago
- Dutch coreference resolution & dialogue analysis using deterministic rules☆21Updated 2 years ago
- An Easy Annotation Tool for Natural Language Processing☆10Updated last year
- ☆53Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆108Updated last year
- Contextualised Word Representations for Lexical Semantic Change Analysis☆31Updated 4 years ago
- A collection of text simplification datasets and other resources☆44Updated 8 months ago
- Analysis and experiments on the UN General Debate corpus☆36Updated 6 years ago
- German sentiment scores with SentiWS as extension for spaCy☆37Updated 2 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆81Updated last year
- A set of tools for topical text classification and scaling☆20Updated 2 years ago
- Repository for the CommonLit Ease of Readability Corpus☆24Updated last year
- Example code producing novelty, transience, and resonance for a sample of legislative speech during the French Revolution.☆16Updated 3 years ago
- A multilingual lexicon of words to hurt.☆89Updated 7 months ago
- Study of semantic evolution of words over time☆20Updated 2 years ago
- A python package for the Linguistic Inquiry and Word Count (LIWC) dictionary.☆40Updated 4 years ago
- Geolocation Inference for Reddit☆12Updated 11 months ago
- Scripts for large-scale prediction of lexical semantic change.☆12Updated 2 years ago
- Code for the paper "Simple, Interpretable and Stable Method for Detecting Words with Usage Change across Corpora", ACL 2020.☆18Updated 4 years ago
- A Python library for calculating a large variety of metrics from text☆339Updated 5 months ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆77Updated last year
- PYthon Automated Term Extraction☆313Updated 2 years ago
- Compass-aligned Distributional Embeddings. Align embeddings from different corpora☆39Updated 2 years ago
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆37Updated last year
- ☆12Updated 4 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆31Updated 3 months ago
- Package to extract connotation frames☆85Updated last year