IBM / comparing-corpora
A python library of similarity measures which allow measuring the perceptual similarity between set embeddings corpora.
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for comparing-corpora
- Authorship Verification in Social Media via Attention-based Similarity Learning☆20Updated 3 years ago
- ☆9Updated last year
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆18Updated 10 months ago
- BirdSpotter is a python package which provides an influence and bot detection toolkit for twitter.☆19Updated 3 years ago
- A small repository to test Captum Explainable AI with a trained Flair transformers-based text classifier.☆26Updated 3 years ago
- Alternative implementation of the coreference scorer for the CoNLL-2011/2012 shared tasks on coreference resolution☆11Updated 3 years ago
- Repository for the Tweet2Story framework for the extraction of narratives from tweets.☆13Updated 2 years ago
- Sentence specificity prediction☆25Updated 5 years ago
- GisPy: A Tool for Measuring Gist Inference Score in Text https://aclanthology.org/2022.wnu-1.5/☆11Updated 4 months ago
- SentimentArcs: a large ensemble of dozens of sentiment analysis models to analyze emotion in text over time☆35Updated last year
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆36Updated last year
- Training Temporal Word Embeddings with a Compass☆63Updated last year
- Analysis and experiments on the UN General Debate corpus☆37Updated 5 years ago
- An implementation of GrASP (Shnarch et. al., 2017)☆21Updated 2 years ago
- Tool for parsing and converting various span encoding schemes.☆22Updated 10 months ago
- ☆53Updated 10 months ago
- This repository implements the interaction with DBLP, information extraction and pre-processing of papers, and a client to store data to …☆10Updated last year
- Learned string similarity for entity names using optimal transport.☆34Updated 3 years ago
- Repository for our ACL 2020 paper "Learning and Evaluating Emotion Lexicons for 91 Languages"☆24Updated last year
- Additional material for the paper "MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction"☆53Updated last year
- Tool for the Automatic Assessment of Lexical Diversity☆11Updated 3 years ago
- ☆8Updated 2 years ago
- ☆50Updated 8 months ago
- OptimSeed - Seed Word Selection for Weakly-Supervised Text Classification [NAACL SRW 2021]☆14Updated 3 years ago
- A Python wrapper around the topic modeling functions of MALLET.☆99Updated last week
- Measuring the Evolution of a Scientific Field through Citation Frames☆53Updated 6 years ago
- Fast, flexible extraction of moral information from textual input data.☆103Updated last year
- Using Huggingface to generate relation expressions☆15Updated 3 years ago