lisadunlap/VibeCheck

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lisadunlap/VibeCheck)

lisadunlap / VibeCheck

Automated Qualitative Analysis of LLMs (ICLR 2025)

☆53

Alternatives and similar repositories for VibeCheck

Users that are interested in VibeCheck are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuhui-zh15 / C3
View on GitHub
Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)
☆36Oct 16, 2024Updated last year
adamobeng / schemagen
View on GitHub
Make tool-calling schemas for existing tools
☆14Mar 8, 2025Updated last year
xAlg-ai / HashAttention-1.0
View on GitHub
☆18Sep 23, 2025Updated 9 months ago
SarahXC / customer-research-tutorial
View on GitHub
Automatically research and outbound companies with Exa API and google sheets app scripts.
☆18Jun 24, 2024Updated 2 years ago
yeung-lab / Micro-Bench
View on GitHub
A Vision-Language Benchmark for Microscopy Understanding
☆31Mar 13, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yuhui-zh15 / AutoConverter
View on GitHub
Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…
☆40May 26, 2025Updated last year
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago
aikunyi / FSTN
View on GitHub
Fourier Spatial-Temporal Network for Multivariate Time Series Forecasting
☆11Jan 1, 2023Updated 3 years ago
amazon-science / text_generation_diffusion_llm_topic
View on GitHub
Topic Embedding, Text Generation and Modeling using diffusion
☆15Jun 10, 2026Updated last month
zenghy96 / Reliable-Source-Approximation
View on GitHub
Reliable Source Approximation: Source-Free Domain Adaptation for Vestibular Schwannoma MRI Segmentation
☆11Dec 28, 2024Updated last year
zimufengyan / MMPN-FD
View on GitHub
The source code (Pytorch version) of paper "Multi-modality augmented Prototypical Network for Fault Diagnosis"
☆11Aug 26, 2024Updated last year
mchiquier / llm-mutate
View on GitHub
☆15Oct 7, 2024Updated last year
koaning / datasette-marimo
View on GitHub
Adding Marimo to Datasette
☆21Mar 24, 2025Updated last year
OneBugMaker / DLCNN
View on GitHub
1 提出了一种新的相似度损失(SimL)，用于增大类间差异同时减小类内差异；2 SimL+CE优化CNN；3 基于电信号诊断轴承故障
☆14Jun 4, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ottiram / WOMBAT
View on GitHub
The Word Embedding Database API
☆11Aug 20, 2019Updated 6 years ago
Nanne / ProtoSim
View on GitHub
Code and instructions accompanying ICCV'23 paper Protoype-based Dataset Comparison
☆18Dec 15, 2023Updated 2 years ago
az1326 / advisor-models
View on GitHub
How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
☆81Feb 5, 2026Updated 5 months ago
cicl-stanford / moca
View on GitHub
Language model evaluation for morality and causality
☆20Nov 14, 2023Updated 2 years ago
Wangrui-berry / Cross-attention
View on GitHub
Cross-Attention Guided Loss-Based Deep Dual-Branch Fusion Network for Liver Tumor Classification
☆16Sep 26, 2024Updated last year
mustafamariam / LLM-Connections-Solver
View on GitHub
Code for Columbia University COMS 3997 – LLM Ethics and Foundations
☆16Jan 7, 2025Updated last year
dwillis / LLM-Extraction-Challenge
View on GitHub
☆23Dec 9, 2025Updated 7 months ago
model-similarity / lm-similarity
View on GitHub
☆21Feb 10, 2025Updated last year
TransluceAI / docent
View on GitHub
☆114Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
orrzohar / Video-STaR
View on GitHub
[ICLR 2025] Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision
☆72Jul 10, 2024Updated 2 years ago
tkuhn / aida
View on GitHub
AIDA Sentences
☆20Feb 19, 2021Updated 5 years ago
facebookresearch / threadweaver
View on GitHub
The implementation for ThreadWeaver Adaptive Threading for Efficient Parallel Reasoning in Language Models
☆67Apr 8, 2026Updated 3 months ago
Yale-LILY / ROSE
View on GitHub
☆41Jun 7, 2023Updated 3 years ago
rycolab / rnn-turing-completeness
View on GitHub
☆15Oct 21, 2023Updated 2 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
romilbhardwaj / kube-tutorial
View on GitHub
Kubernetes Tutorial for the PS2 group meetings at UC Berkeley
☆17Mar 23, 2023Updated 3 years ago
CQU-ZixuChen / MSGCN-CSP
View on GitHub
The source code of the model: Multi-Scale Graph Convolutional Network with Contrastive-Learning enhanced Self-attention Pooling (MSGCN-CS…
☆18May 12, 2026Updated 2 months ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rmovva / wimhf
View on GitHub
What's In My Human Feedback? Explaining preferences in human feedback using interpretability + LLMs. https://arxiv.org/abs/2510.26202
☆26May 9, 2026Updated 2 months ago
Cloudfly-Z / ACAN
View on GitHub
ACAN: A Plug-and-Play Adaptive Center-Aligned Network for Unsupervised Domain Adaptation
☆21Sep 12, 2024Updated last year
OpenBMB / ConsJudge
View on GitHub
☆18Mar 23, 2025Updated last year
lava-security-research / forge-framework
View on GitHub
Top 10 Data Centers & AI Infrastructure Security Risks
☆16Updated this week
JethroJames / TUNED
View on GitHub
[AAAI 2025] Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification
☆20Apr 17, 2025Updated last year
BoHuangLab / CELL-E_2
View on GitHub
Multimodal encoder-only transformer model for image-based protein predictions
☆15Dec 12, 2023Updated 2 years ago
GAIR-NLP / Preference-Dissection
View on GitHub
☆25May 16, 2024Updated 2 years ago