rwalk / gsdmm-rust
GSDMM: Short text clustering (Rust implementation)
☆23Updated last year
Related projects ⓘ
Alternatives and complementary repositories for gsdmm-rust
- Model for learning document embeddings along with their uncertainties☆35Updated 11 months ago
- Converter from UD-trees to BART representation☆36Updated 8 months ago
- A monolingual parallel corpus for sentence simplification☆11Updated 8 years ago
- Official details for: [1803.08493] Context is Everything: Finding Meaning Statistically in Semantic Spaces☆39Updated 5 years ago
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆48Updated last year
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 2 years ago
- WordMoversEmbeddings(WME) is a simple code for generating the vector representation of sentence/document for text classification and clus…☆81Updated 5 years ago
- Extracting narrative timelines (i.e. order and timing of events) from text☆20Updated 5 years ago
- code for our EMNLP2020 paper: Multilevel Text Alignment with Cross-Document Attention by Xuhui Zhou, Nikolaos Pappas, and Noah A. Smith☆13Updated 3 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆43Updated 4 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Updated 7 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`☆18Updated 2 years ago
- Entity Linking in Queries: Efficiency vs. Effectiveness☆18Updated 7 years ago
- Python library to work with ConceptNet offline☆10Updated last year
- A benchmark to test linguistic robustness.☆12Updated 3 years ago
- Learned string similarity for entity names using optimal transport.☆34Updated 4 years ago
- PyTorch Implementation of Autoencoding Variational Inference for Topic Models (Srivastava and Sutton 2017)☆38Updated 5 years ago
- "Learning What is Essential in Questions", CoNLL, 2017☆26Updated 6 years ago
- Lightweight method based on shortest path on word graphs and NLP to generate single sentence summaries that highly relevant and grammatic…☆19Updated 7 years ago
- Extractors whose input is a chunked sentence. Includes Relnoun, Nesty, and a scala interface for ReVerb.☆28Updated 7 years ago
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆40Updated 2 years ago
- Short Text Topic Modeling☆65Updated 6 years ago
- TREC Microblog 2011-2014 Datasets☆1Updated 5 years ago
- Code for the paper: "Cross-domain Semantic Parsing via Paraphrasing" - EMNLP 2017☆15Updated 6 years ago
- A tool for learning significant phrase/term models, and efficiently labeling with them.☆31Updated last year
- BERT models for many languages created from Wikipedia texts☆34Updated 4 years ago
- Tools for working with the TREC CAR dataset.☆36Updated 2 years ago
- Pre-training character n-gram embeddings☆23Updated last year
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Neural topic modeling☆29Updated 4 years ago