[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
☆91Feb 10, 2022Updated 4 years ago
Alternatives and similar repositories for TopClus
Users that are interested in TopClus are comparing it to the libraries listed below
Sorting:
- Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)☆18Feb 18, 2025Updated last year
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆58Feb 14, 2021Updated 5 years ago
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆15May 27, 2023Updated 2 years ago
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆174Feb 3, 2023Updated 3 years ago
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆51Jan 6, 2021Updated 5 years ago
- [EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training☆65Nov 12, 2021Updated 4 years ago
- Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)☆32Jun 21, 2025Updated 8 months ago
- Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)☆45Apr 2, 2024Updated last year
- Topic taxonomy completion with hierarchical discovery of novel topic clusters☆24Mar 7, 2022Updated 3 years ago
- code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"☆39Apr 22, 2020Updated 5 years ago
- The source code used for paper "Unsupervised Key Event Detection from Massive Text Corpora", published in KDD 2022.☆22Jul 15, 2023Updated 2 years ago
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 3 years ago
- An updated version of eICU Benchmark with an updated problem definition on LoS and Decompensation tasks☆11Aug 12, 2021Updated 4 years ago
- PyTorch Implementation of Autoencoding Variational Inference for Topic Models (Srivastava and Sutton 2017)☆39Oct 1, 2019Updated 6 years ago
- ☆13Sep 2, 2021Updated 4 years ago
- ☆25Oct 27, 2020Updated 5 years ago
- ☆88Dec 5, 2021Updated 4 years ago
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Aug 19, 2022Updated 3 years ago
- Code for the paper "NetTaxo: Automated Topic Taxonomy Constructionfrom Text-Rich Network"☆32Feb 23, 2022Updated 4 years ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Jul 8, 2022Updated 3 years ago
- ☆26Nov 26, 2021Updated 4 years ago
- [EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach☆299Feb 2, 2022Updated 4 years ago
- ArSarcasm-v2 is an extension to the original ArSarcasm dataset. It was used for the shared task on sarcasm detection and sentiment analys…☆12Jan 26, 2022Updated 4 years ago
- ☆16Jun 19, 2023Updated 2 years ago
- ☆14May 15, 2020Updated 5 years ago
- [AAAI 2019] Weakly-Supervised Hierarchical Text Classification☆86Dec 11, 2021Updated 4 years ago
- The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.☆33Nov 3, 2020Updated 5 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,265Jul 24, 2025Updated 7 months ago
- A Deep RL Wordle Bot☆12Dec 6, 2022Updated 3 years ago
- This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"☆17Apr 22, 2021Updated 4 years ago
- Optimization Models and Algorithms☆17Updated this week
- Explainable Zero-Shot Topic Extraction☆65Aug 19, 2024Updated last year
- Implementation of topic models based on neural network approaches.☆435Sep 27, 2023Updated 2 years ago
- ☆32Jul 10, 2023Updated 2 years ago
- ☆18Jul 11, 2021Updated 4 years ago
- This is a Pytorch implementation of "Deep Low-Rank Subspace Clustering" (CVPRW 2020).☆14Sep 18, 2020Updated 5 years ago
- Differential Robust PCA☆13Jul 10, 2022Updated 3 years ago
- The NewSHead dataset is a multi-doc headline dataset used in NHNet for training a headline summarization model.☆37Jan 7, 2022Updated 4 years ago