Wang-ML-Lab / interpretable-foundation-modelsLinks

[ICML 2024] Probabilistic Conceptual Explainers (PACE): Trustworthy Conceptual Explanations for Vision Foundation Models

☆16

Alternatives and similar repositories for interpretable-foundation-models

Users that are interested in interpretable-foundation-models are comparing it to the libraries listed below

Sorting:

huaxiuyao / Wild-Time
Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)
☆67Updated 2 years ago
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆87Updated 8 months ago
YefanZhou / TempBalance
[NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training
☆35Updated 3 months ago
tmlr-group / NoisyRationales
[NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"
☆35Updated 6 months ago
YyzHarry / SubpopBench
[ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift
☆108Updated 2 years ago
MaximeRobeyns / bayesian_lora
Bayesian Low-Rank Adaptation for Large Language Models
☆34Updated last year
nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆45Updated 8 months ago
miniHuiHui / awesome-out-of-distribution-detection
Paper of out of distribution detection and generalization
☆56Updated last year
anniesch / surgical-finetuning
Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023
☆29Updated 2 years ago
gortizji / tangent_task_arithmetic
Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".
☆102Updated 2 years ago
Aboriginer / EOE
[ICML 2024] "Envisioning Outlier Exposure by Large Language Models for Out-of-Distribution Detection"
☆15Updated 5 months ago
aengusl / spawrious
☆27Updated last year
tmlr-group / G-effect
[ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"
☆11Updated 4 months ago
siyan-zhao / ICL_decision_boundary
official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…
☆19Updated 10 months ago
harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆59Updated 4 months ago
clemneo / llava-interp
☆57Updated 8 months ago
deeplearning-wisc / scone
☆9Updated last year
skzhang1 / IDEAL
IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models
☆59Updated last year
BigML-CS-UCLA / SpuCo
SpuCo is a Python package developed to further research to address spurious correlations.
☆24Updated 6 months ago
Lingkai-Kong / RE-Control
Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective
☆32Updated 5 months ago
deeplearning-wisc / cider
PyTorch implementation of CIDER (How to exploit hyperspherical embeddings for out-of-distribution detection), ICLR 2023
☆61Updated last year
Trustworthy-ML-Lab / Label-free-CBM
[ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…
☆107Updated last year
tmlr-group / CoPA
[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"
☆11Updated 8 months ago
Wuyxin / DISC
(ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation
☆41Updated last year
tanganke / weight-ensembling_MoE
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆27Updated last year
ZFancy / DivOE
[NeurIPS 2023] "Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation"
☆11Updated last year
shenlei515 / VHL-paddle
translation of VHL repo in paddle
☆25Updated 2 years ago
warriors-30 / SFAT-paddle
☆24Updated 2 years ago
YuheD / awesome-model-transferability-estimation
A collection of model transferability estimation methods.
☆28Updated 9 months ago
hee-suk-yoon / C-TPT
[ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"
☆17Updated last year