abhilash1910 / ClusterTransformer
Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huggingface.
☆43Updated 3 years ago
Alternatives and similar repositories for ClusterTransformer:
Users that are interested in ClusterTransformer are comparing it to the libraries listed below
- ☆59Updated 3 years ago
- Repo for EMNLP 2020 paper, "Improving Neural Topic Models using Knowledge Distillation"☆31Updated 4 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 4 years ago
- State of the art Semantic Sentence Embeddings☆98Updated 2 years ago
- https://arxiv.org/pdf/1909.04054☆78Updated 2 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆79Updated 2 years ago
- ☆85Updated 3 years ago
- An Unsupervised Sentence Embedding Method by Mutual Information Maximization (EMNLP2020)☆61Updated 3 years ago
- Implementation, trained models and result data for the paper "Pairwise Multi-Class Document Classification for Semantic Relations between…☆32Updated last year
- ☆67Updated 3 years ago
- ☆42Updated 4 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆65Updated 2 years ago
- PyTorch Implementation of Autoencoding Variational Inference for Topic Models (Srivastava and Sutton 2017)☆38Updated 5 years ago
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆46Updated 2 years ago
- Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding (AAAI 2020) - PyTorch Implementation☆31Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆49Updated 3 years ago
- LongSumm - Scientific Document Summarization Task☆74Updated 2 years ago
- Snippext: Semi-supervised Opinion Mining with Augmented Data☆58Updated last year
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆52Updated last year
- Implementation of EMNLP2020 accepted paper: "TopicBERT: Topic-aware BERT for Efficient Document Classification"☆43Updated 4 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆93Updated 2 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆50Updated 3 years ago
- Code for NAACL 2021 full paper "Efficient Attentions for Long Document Summarization"☆65Updated 3 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- Using BERT For Classifying Documents with Long Texts, check my latest post: https://armandolivares.tech/☆41Updated 5 years ago
- Multi^2OIE: Multilingual Open Information Extraction Based on Multi-Head Attention with BERT (Findings of ACL: EMNLP 2020)☆56Updated 2 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 2 years ago
- Few-shot learning framework for opinion summarization published at EMNLP 2020.☆35Updated 3 years ago
- Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public data…☆54Updated 2 years ago