rwalk / gsdmm-rust
GSDMM: Short text clustering (Rust implementation)
☆23Updated last year
Alternatives and similar repositories for gsdmm-rust:
Users that are interested in gsdmm-rust are comparing it to the libraries listed below
- Learned string similarity for entity names using optimal transport.☆35Updated 4 years ago
- ☆54Updated 3 years ago
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆52Updated this week
- SemEval 2019 Hyperpartisan News Detection - team Bertha von Suttner contribution☆22Updated 5 years ago
- Supporting example for "A Rust SentencePiece implementation"☆18Updated 4 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆32Updated last year
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆81Updated 6 years ago
- PyTorch Implementation of Autoencoding Variational Inference for Topic Models (Srivastava and Sutton 2017)☆38Updated 5 years ago
- Short Text Topic Modeling☆65Updated 6 years ago
- Dynamic Topic Modeling and Topic Chains of Reuters News Articles using SCVB0☆23Updated 8 years ago
- Neural topic modeling☆29Updated 4 years ago
- Implementation of Deep Dirichlet Multinomial Regression in python + cython.☆16Updated 6 years ago
- Repo for MCMC based Dynamic Topic Model☆16Updated 7 years ago
- Making word sense embeddings interpretable. A tool for matching word sense embeddings with synsets of lexical resources.☆11Updated 8 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475☆16Updated 6 years ago
- Interpretable feature construction from taxonomies for text classification☆18Updated 2 years ago
- Tools relating to the CC-News-En Collection☆19Updated last year
- Converter from UD-trees to BART representation☆36Updated 11 months ago
- A curated list of resources related to temporal embeddings☆14Updated 6 years ago
- Model for learning document embeddings along with their uncertainties☆35Updated last year
- PyTorch implementation of context2vec from Melamud et al., CoNLL 2016☆19Updated 6 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 2 years ago
- Sentiment polarity annotations dataset☆26Updated 7 years ago
- 🦀 A Rust implementation of a RoBERTa classification model for the SNLI dataset☆13Updated 3 years ago
- code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith☆13Updated 3 years ago
- Implementation of GloVe in Keras☆45Updated 2 years ago
- Code and data for ACL2016 article "Which argument is more convincing? Analyzing and predicting convincingness of Web arguments using bidi…☆28Updated 8 years ago
- AllenNLP model for the Kaggle toxic comments challenge☆32Updated 6 years ago
- A set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification☆29Updated 3 weeks ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Updated 3 years ago