suzgunmirac / belief-in-the-machineLinks
Belief in the Machine: Investigating Epistemological Blind Spots of Language Models
☆24Updated 7 months ago
Alternatives and similar repositories for belief-in-the-machine
Users that are interested in belief-in-the-machine are comparing it to the libraries listed below
Sorting:
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆102Updated 7 months ago
- Get answers to research questions from 200M+ papers. Link to demo -☆207Updated last month
- SciRepEval benchmark training and evaluation scripts☆78Updated 2 weeks ago
- Code and data for the paper 'The impact of founder personalities on startup success'☆17Updated 2 months ago
- A BERT-based application for reusable text classification at scale☆38Updated 2 years ago
- Robust and fast topic models with sentence-transformers.☆83Updated last week
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆15Updated last month
- [ICML 2025] HypotheSAEs: Hypothesizing interpretable relationships in text datasets using sparse autoencoders. https://arxiv.org/abs/2502…☆65Updated last month
- Generates and optimizes Haiku system and user prompts for classification☆13Updated last month
- A Fast, Adaptive, Stable, and Transferable Topic Model (NeurIPS 2024)☆134Updated 4 months ago
- HDBSCAN Tuning for BERTopic Models☆49Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 2 years ago
- ☆110Updated last month
- ☆258Updated 8 months ago
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Updated last year
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.☆17Updated last year
- human_detectors hosts the data released from the paper "People who frequently use ChatGPT for writing tasks are accurate and robust detec…☆40Updated 7 months ago
- Lawma: A lightly fine-tuned Llama model for legal classification tasks.☆25Updated last year
- ☆53Updated last year
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆63Updated last year
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆143Updated 6 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Causal DAG Extraction from Text (DEFT)☆66Updated 11 months ago
- Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.☆250Updated 10 months ago
- TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)☆369Updated 8 months ago
- ☆100Updated last year
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆90Updated last year
- 🗺️ Data Cleaning and Textual Data Visualization 🗺️☆191Updated 6 months ago
- Detecting Bias and ensuring Fairness in AI solutions☆102Updated 2 years ago