abhilash1910 / ClusterTransformerLinks
Topic clustering library built on Transformer embeddings and cosine similarity metrics.Compatible with all BERT base transformers from huggingface.
☆43Updated 4 years ago
Alternatives and similar repositories for ClusterTransformer
Users that are interested in ClusterTransformer are comparing it to the libraries listed below
Sorting:
- ☆59Updated 4 years ago
- https://arxiv.org/pdf/1909.04054☆79Updated 3 years ago
- Self-supervised NER prototype - updated version (69 entity types - 17 broad entity groups). Uses pretrained BERT models with no fine tuni…☆78Updated 3 years ago
- Implementation, trained models and result data for the paper "Aspect-based Document Similarity for Research Papers" #COLING2020☆63Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆205Updated 3 years ago
- ☆88Updated 3 years ago
- A library to conduct ranking experiments with transformers.☆160Updated 2 years ago
- State of the art Semantic Sentence Embeddings☆99Updated 3 years ago
- Source code for our AAAI 2020 paper P-SIF: Document Embeddings using Partition Averaging☆35Updated 5 years ago
- Implementation of DeepXML☆64Updated 3 years ago
- Code for Paper "Target-oriented Fine-tuning for Zero-Resource Named Entity Recognition"☆20Updated 3 years ago
- ☆42Updated 5 years ago
- Code and resources for the paper "BERT-QE: Contextualized Query Expansion for Document Re-ranking".☆51Updated 4 years ago
- A text augmentation tool for named entity recognition.☆54Updated 4 years ago
- Stacked Denoising BERT for Noisy Text Classification (Neural Networks 2020)☆32Updated 2 years ago
- Corresponding code repo for the paper at COLING 2020 - ARGMIN 2020: "DebateSum: A large-scale argument mining and summarization dataset"☆55Updated 3 years ago
- GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition☆31Updated 3 years ago
- Implementation of SiameseXML (ICML 2021)☆40Updated 3 years ago
- On Generating Extended Summaries of Long Documents☆78Updated 4 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆68Updated 3 years ago
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆91Updated 3 years ago
- Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper☆51Updated 2 years ago
- ☆68Updated 6 months ago
- ☆18Updated 3 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆64Updated 5 years ago
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆58Updated 4 years ago
- Code for the paper "True Few-Shot Learning in Language Models" (https://arxiv.org/abs/2105.11447)☆144Updated 4 years ago
- DECAF: Deep Extreme Classification with Label Features☆54Updated 3 years ago
- Implementation of EMNLP2020 accepted paper: "TopicBERT: Topic-aware BERT for Efficient Document Classification"☆43Updated 5 years ago
- ☆52Updated 4 years ago