r0zetta / meta_embedding_clusteringLinks
Clustering of tweets based on textual content using meta embeddings and community detection
☆13Updated 5 years ago
Alternatives and similar repositories for meta_embedding_clustering
Users that are interested in meta_embedding_clustering are comparing it to the libraries listed below
Sorting:
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆23Updated 6 years ago
- ☆54Updated 4 years ago
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆83Updated 7 years ago
- Training Temporal Word Embeddings with a Compass☆65Updated 4 months ago
- ☆16Updated 7 years ago
- ☆22Updated 2 years ago
- Model for learning document embeddings along with their uncertainties☆36Updated 2 years ago
- A Large Automatically-Constructed Resource of Predicate Paraphrases☆45Updated 5 years ago
- Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"☆31Updated 5 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆83Updated last year
- Train, evaluate, and use different unsupervised topic modelling algorithms using a RESTful API.☆38Updated 2 years ago
- Computation of the semantic interpretability of topics produced by topic models.☆179Updated 8 years ago
- Short Text Topic Modeling☆65Updated 7 years ago
- Code and data for the WSDM '19 paper "Crosslingual Document Embedding as Reduced-Rank Ridge Regression (Cr5)"☆30Updated 6 years ago
- MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.☆15Updated 6 years ago
- Template for AC297r projects☆33Updated 5 years ago
- This is an implementation of Hearst patterns, for finding hyponyms, written in Python.☆87Updated 3 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆63Updated last year
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- Software for the paper "Gender and Lexical Variation in Social Media" with David Bamman and Tyler Schnoebelen☆17Updated 10 years ago
- Dataset and code of our EMNLP 2019 paper "Multilingual and Multi-Aspect Hate Speech Analysis"☆57Updated last year
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Updated 3 years ago
- This repository contains the code for the Form-Context Model and its Attentive Mimicking variant.☆31Updated 5 years ago
- Learned string similarity for entity names using optimal transport.☆35Updated 5 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆51Updated 2 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 5 years ago
- Palmetto is a quality measuring tool for topics☆221Updated last year
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆115Updated 5 years ago
- Code to compute topic coherence for several topic cardinalities and aggregate scores across them☆22Updated 4 months ago