michelle123lam / lloom
Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-level concepts to analyze unstructured text.
☆58Updated last week
Related projects ⓘ
Alternatives and complementary repositories for lloom
- Official Implementation of TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL '24)☆217Updated this week
- ☆82Updated 5 months ago
- HDBSCAN Tuning for BERTopic Models☆42Updated last year
- Package to extract connotation frames☆79Updated 10 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆77Updated 3 months ago
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆146Updated 4 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆92Updated last month
- The Prism Alignment Project☆37Updated 6 months ago
- ☆86Updated 5 months ago
- ☆94Updated 6 months ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- ☆196Updated 2 weeks ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆29Updated 8 months ago
- ☆20Updated last year
- SciRepEval benchmark training and evaluation scripts☆67Updated 5 months ago
- Dataset repository for SDPROC SHared Task: Context24: Contextualizing Scientific Figures and Tables☆18Updated 5 months ago
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆182Updated last month
- ☆29Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆104Updated 7 months ago
- The Harvard USPTO Patent Dataset☆55Updated 10 months ago
- Get answers to research questions from 200M+ papers. Link to demo -☆203Updated 10 months ago
- Guideline following Large Language Model for Information Extraction☆309Updated 2 weeks ago
- ☆63Updated last month
- ☆62Updated 7 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆101Updated 5 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?☆131Updated 10 months ago
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts☆53Updated last year
- ☆208Updated 8 months ago
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat data…☆76Updated 10 months ago
- potato: portable text annotation tool☆296Updated 2 weeks ago