[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
☆92Feb 10, 2022Updated 4 years ago
Alternatives and similar repositories for TopClus
Users that are interested in TopClus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)☆18Feb 18, 2025Updated last year
- [KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding☆57Feb 14, 2021Updated 5 years ago
- The source code used for paper "Effective Seed-Guided Topic Discovery by Integrating Multiple Types of Contexts", in WSDM 2023.☆14May 27, 2023Updated 2 years ago
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆51Jan 6, 2021Updated 5 years ago
- The source code used for paper "Unsupervised Key Event Detection from Massive Text Corpora", published in KDD 2022.☆22Jul 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This project provides an unsupervised framework for mining and tagging quality phrases on text corpora with pretrained language models (K…☆173Feb 3, 2023Updated 3 years ago
- The source code used for paper "TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision…☆24Apr 6, 2025Updated last year
- Metadata-Induced Contrastive Learning for Zero-Shot Multi-Label Text Classification (WWW'22)☆32Jun 21, 2025Updated 9 months ago
- Code for paper OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision☆31May 9, 2022Updated 3 years ago
- Hierarchical Metadata-Aware Document Categorization under Weak Supervision (WSDM'21)☆45Apr 2, 2024Updated 2 years ago
- ☆16Jun 19, 2023Updated 2 years ago
- ☆27Nov 26, 2021Updated 4 years ago
- The source code used for paper "Empower Entity Set Expansion via Language Model Probing", published in ACL 2020.☆33Nov 3, 2020Updated 5 years ago
- Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate Speech (ACL-IJCNLP 2021 Findings)☆13Jun 22, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An easy-to-use tool for phrase encoding and topic mining (unsupervised aspect extraction); Code base for ACL 2022 paper, UCTopic: Unsuper…☆46Apr 25, 2023Updated 2 years ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Jul 8, 2022Updated 3 years ago
- OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)☆801Feb 20, 2026Updated last month
- The source code for SetExpan framework, published in ECML-PKDD 2017☆32Nov 22, 2021Updated 4 years ago
- [ACL 2024 Findings] This is the code for our paper "Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation wi…☆41Jun 23, 2024Updated last year
- ☆25Oct 27, 2020Updated 5 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆69Sep 18, 2022Updated 3 years ago
- Topic taxonomy completion with hierarchical discovery of novel topic clusters☆24Mar 7, 2022Updated 4 years ago
- [AAAI 2019] Weakly-Supervised Hierarchical Text Classification☆85Dec 11, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [AAAI 2023] This is the code for our paper `Neighborhood-Regularized Self-Training for Learning with Few Labels'.☆12Jan 11, 2023Updated 3 years ago
- A web interface to understand language-specific BERT-models☆18Apr 16, 2024Updated last year
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆24Sep 19, 2024Updated last year
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 9 months ago
- PyTorch Implementation of Autoencoding Variational Inference for Topic Models (Srivastava and Sutton 2017)☆39Oct 1, 2019Updated 6 years ago
- code and data for paper "GIANT: Scalable Creation of a Web-scale Ontology"☆39Apr 22, 2020Updated 5 years ago
- ☆32Jul 10, 2023Updated 2 years ago
- Code for ACL 2021 paper: Accelerating BERT Inference for Sequence Labeling via Early-Exit☆28Aug 19, 2022Updated 3 years ago
- Retrieval as Attention☆81Dec 16, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…☆18Mar 25, 2024Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆65Aug 19, 2024Updated last year
- Code for the paper "NetTaxo: Automated Topic Taxonomy Constructionfrom Text-Rich Network"☆32Feb 23, 2022Updated 4 years ago
- ☆18Jul 11, 2021Updated 4 years ago
- MATCH: Metadata-Aware Text Classification in A Large Hierarchy (WWW'21)☆118Apr 2, 2024Updated 2 years ago
- Codes for the EMNLP 2020 paper -- "FIND: Human-in-the-loop Debugging Deep Text Classifiers"☆18Nov 16, 2020Updated 5 years ago
- ☆35Mar 2, 2023Updated 3 years ago