[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
☆91Feb 10, 2022Updated 4 years ago
Alternatives and similar repositories for TopClus
Users that are interested in TopClus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)☆18Feb 18, 2025Updated last year
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆14May 27, 2023Updated 2 years ago
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆51Jan 6, 2021Updated 5 years ago
- The source code used for paper "Unsupervised Key Event Detection from Massive Text Corpora", published in KDD 2022.☆22Jul 15, 2023Updated 2 years ago
- [EMNLP 2021] Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training☆65Nov 12, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆173Feb 3, 2023Updated 3 years ago
- The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision…☆25Apr 6, 2025Updated 11 months ago
- Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)☆32Jun 21, 2025Updated 9 months ago
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 3 years ago
- ☆16Jun 19, 2023Updated 2 years ago
- Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)☆45Apr 2, 2024Updated last year
- ☆27Nov 26, 2021Updated 4 years ago
- The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.☆33Nov 3, 2020Updated 5 years ago
- Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech (ACL-IJCNLP 2021 Findings)☆13Jun 22, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- An easy-to-use tool for phrase encoding and topic mining (unsupervised aspect extraction); Code base for ACL 2022 paper, UCTopic: Unsuper…☆47Apr 25, 2023Updated 2 years ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Jul 8, 2022Updated 3 years ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆799Feb 20, 2026Updated last month
- [EMNLP 2020] Text Classification Using Label Names Only: A Language Model Self-Training Approach☆298Feb 2, 2022Updated 4 years ago
- The source code for SetExpan framework, published in ECML-PKDD 2017☆32Nov 22, 2021Updated 4 years ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆41Jun 23, 2024Updated last year
- ☆25Oct 27, 2020Updated 5 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆69Sep 18, 2022Updated 3 years ago
- Topic taxonomy completion with hierarchical discovery of novel topic clusters☆24Mar 7, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [AAAI 2019] Weakly-Supervised Hierarchical Text Classification☆85Dec 11, 2021Updated 4 years ago
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆23Sep 19, 2024Updated last year
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 9 months ago
- PyTorch Implementation of Autoencoding Variational Inference for Topic Models (Srivastava and Sutton 2017)☆39Oct 1, 2019Updated 6 years ago
- code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"☆39Apr 22, 2020Updated 5 years ago
- ☆13Sep 2, 2021Updated 4 years ago
- ☆32Jul 10, 2023Updated 2 years ago
- [NeurIPS 2019] Spherical Text Embedding☆184Oct 29, 2023Updated 2 years ago
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Retrieval as Attention☆81Dec 16, 2022Updated 3 years ago
- Hierarchical, multi-label topic modelling with LDA☆54Dec 8, 2022Updated 3 years ago
- Explainable Zero-Shot Topic Extraction☆65Aug 19, 2024Updated last year
- ☆18Jul 11, 2021Updated 4 years ago
- MATCH: Metadata-Aware Text Classification in A Large Hierarchy (WWW'21)☆118Apr 2, 2024Updated last year
- Codes for the EMNLP 2020 paper -- "FIND: Human-in-the-loop Debugging Deep Text Classifiers"☆18Nov 16, 2020Updated 5 years ago
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 2 years ago