jjcherian / conformal-safety
☆20Updated 2 months ago
Alternatives and similar repositories for conformal-safety:
Users that are interested in conformal-safety are comparing it to the libraries listed below
- Conformal Language Modeling☆28Updated last year
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)☆40Updated 10 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆62Updated 4 months ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Updated last year
- This repository contains the implementation of Concept Activation Regions, a new framework to explain deep neural networks with human con…☆11Updated 2 years ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆65Updated last year
- Code for paper: Are Large Language Models Post Hoc Explainers?☆30Updated 6 months ago
- [ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift☆105Updated last year
- ☆26Updated last year
- [ICML'24] Conformal Prediction for Deep Classifier via Label Ranking☆12Updated 8 months ago
- Code for paper: Aligning Large Language Models with Representation Editing: A Control Perspective☆24Updated 3 weeks ago
- Official repository of ICML 2023 paper: Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repeat☆23Updated 11 months ago
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆16Updated 2 months ago
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆30Updated last year
- A reproduced PyTorch implementation of the Adversarially Reweighted Learning (ARL) model, originally presented in "Fairness without Demog…☆21Updated 4 years ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆39Updated 3 months ago
- Implementation of Concept-level Debugging of Part-Prototype Networks☆11Updated last year
- SpuCo is a Python package developed to further research to address spurious correlations.☆24Updated last month
- Code for the paper "A Whac-A-Mole Dilemma Shortcuts Come in Multiples Where Mitigating One Amplifies Others"☆47Updated 7 months ago
- A fast, effective data attribution method for neural networks in PyTorch☆192Updated 3 months ago
- Unofficial implementation of Conformal Language Modeling by Quach et al☆29Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆69Updated 11 months ago
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆32Updated 3 months ago
- A toolkit for quantitative evaluation of data attribution methods.☆39Updated this week
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆18Updated last year
- ☆44Updated 2 years ago
- Efficient empirical NTKs in PyTorch☆18Updated 2 years ago
- ☆41Updated 2 weeks ago
- ☆16Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆31Updated last month