lisadunlap / VibeCheckLinks
Automated Qualitative Analysis of LLMs (ICLR 2025)
☆52Updated 7 months ago
Alternatives and similar repositories for VibeCheck
Users that are interested in VibeCheck are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆40Updated 3 months ago
- Training Proactive and Personalized LLM Agents☆98Updated 2 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆69Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Official Repo for CRMArena and CRMArena-Pro☆132Updated this week
- ☆91Updated last month
- ☆53Updated 11 months ago
- ☆49Updated 10 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated last year
- The first dense retrieval model that can be prompted like an LM☆90Updated 9 months ago
- An attribution library for LLMs☆46Updated last year
- ☆44Updated last year