Wang-ML-Lab / interpretable-foundation-modelsLinks
[ICML 2024] Probabilistic Conceptual Explainers (PACE): Trustworthy Conceptual Explanations for Vision Foundation Models
☆18Updated last month
Alternatives and similar repositories for interpretable-foundation-models
Users that are interested in interpretable-foundation-models are comparing it to the libraries listed below
Sorting:
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆67Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆94Updated last year
- Bayesian Low-Rank Adaptation for Large Language Models☆36Updated last year
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆37Updated 3 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 7 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆51Updated last year
- [ICML 2025] No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces (official repository)☆28Updated 3 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆105Updated 2 years ago
- Bayesian low-rank adaptation for large language models☆26Updated last year
- [ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift☆110Updated 2 years ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆42Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆13Updated 8 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆71Updated 8 months ago
- Code for 'CausalAdv: Adversarial Robustness Through the Lens of Causality'☆43Updated last year
- Paper of out of distribution detection and generalization☆56Updated 2 years ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆51Updated last year
- Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>☆62Updated last month
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆30Updated last month
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆34Updated 9 months ago
- SpuCo is a Python package developed to further research to address spurious correlations.☆24Updated 9 months ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆29Updated 2 years ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Updated 3 months ago
- ☆78Updated 3 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- A Sober Look at Language Model Reasoning☆87Updated last month
- Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".☆24Updated 8 months ago
- ☆24Updated 2 years ago
- This repository contains the implementation of Concept Activation Regions, a new framework to explain deep neural networks with human con…☆14Updated 3 years ago
- This is the project for IRM methods☆13Updated 4 years ago
- translation of VHL repo in paddle☆25Updated 2 years ago