lisadunlap / VibeCheck
Official Implementation of "VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models"
☆25Updated last week
Alternatives and similar repositories for VibeCheck:
Users that are interested in VibeCheck are comparing it to the libraries listed below
- ☆13Updated last month
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆22Updated 2 months ago
- ☆39Updated 5 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated last year
- gzip Predicts Data-dependent Scaling Laws☆33Updated 7 months ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks"