LMU-Seminar-LLMs / TopicGPT
TopicGPT allows to integrate the benefits of LLMs into Topic Modelling
☆22Updated 8 months ago
Alternatives and similar repositories for TopicGPT:
Users that are interested in TopicGPT are comparing it to the libraries listed below
- ☆18Updated last year
- Scripts to evaluate various bias metrics for different NLG models + decoding algorithms☆16Updated last year
- ☆70Updated 5 months ago
- Aligned Neural Topic Model (ANTM) for Exploring Evolving Topics: a dynamic neural topic model that uses document embeddings (data2vec) to…☆35Updated last year
- A collection of topic diversity measures for topic modeling☆45Updated 3 years ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆22Updated 11 months ago
- ☆33Updated 4 years ago
- [WWW 2020] Discriminative Topic Mining via Category-Name Guided Text Embedding☆50Updated 4 years ago
- A package to run embedded topic modelling with ETM. Adapted from the original at: https://github.com/adjidieng/ETM☆95Updated last year
- The Harvard USPTO Patent Dataset☆64Updated last year
- 🕸️ A graph-augmented dense statute retriever. (EACL 2023)☆21Updated last year
- ☆32Updated last year
- This repository contains the paperlist of CSS.☆26Updated 2 years ago
- [WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations☆87Updated 3 years ago
- ☆52Updated last year
- Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds (NAACL'22)☆17Updated 2 weeks ago
- ☆15Updated 3 years ago
- ☆42Updated 9 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆88Updated last year
- Code for Everything Has a Cause: Leveraging Causal Inference in Legal Text Analysis (NAACL 2021 oral paper)☆66Updated 2 years ago
- ☆38Updated 2 months ago
- Topic taxonomy completion with hierarchical discovery of novel topic clusters☆24Updated 3 years ago
- HDBSCAN Tuning for BERTopic Models☆44Updated last year
- Code and dataset for the emnlp paper titled Instruct and Extract: Instruction Tuning for On-Demand Information Extraction☆49Updated last year
- Dataset and code for "Explainable Automated Fact-Checking for Public Health Claims" from EMNLP 2020.☆58Updated 3 years ago
- NoMIRACL: A multilingual hallucination evaluation dataset to evaluate LLM robustness in RAG against first-stage retrieval errors on 18 la…☆21Updated 3 months ago
- ☆17Updated 2 years ago
- ☆17Updated 3 years ago
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multiling…☆12Updated 6 years ago
- Implementation of the paper "FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (NAACL 2022)"☆47Updated last year