Wang-ML-Lab / interpretable-foundation-modelsLinks
[ICML 2024] Probabilistic Conceptual Explainers (PACE): Trustworthy Conceptual Explanations for Vision Foundation Models
☆16Updated 2 months ago
Alternatives and similar repositories for interpretable-foundation-models
Users that are interested in interpretable-foundation-models are comparing it to the libraries listed below
Sorting:
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆67Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆88Updated 10 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆67Updated 6 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 4 months ago
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆32Updated 7 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆37Updated last month
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆49Updated 10 months ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆19Updated last month
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆104Updated 2 years ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆35Updated 2 months ago
- Bayesian Low-Rank Adaptation for Large Language Models☆35Updated last year
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆43Updated last month
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆29Updated last year
- Paper of out of distribution detection and generalization☆56Updated 2 years ago
- ☆73Updated 3 years ago
- [NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features☆58Updated 3 years ago
- ☆67Updated 9 months ago
- ☆34Updated last year
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆29Updated 2 years ago
- Bayesian low-rank adaptation for large language models☆23Updated last year
- [ICLR 2023, ICLR DG oral] PAIR, the optimizer and model selection criteria for OOD Generalization☆52Updated last year
- ☆21Updated last year
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆29Updated 7 months ago
- This is the project for IRM methods☆13Updated 3 years ago
- Official code for the paper: DRA-GRPO: Exploring Diversity-Aware Reward Adjustment for R1-Zero-Like Training of Large Language Models☆20Updated 2 months ago
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆12Updated 6 months ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆41Updated last year
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆21Updated 11 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- ☆35Updated last year