jjcherian / conformal-safetyLinks
☆32Updated 11 months ago
Alternatives and similar repositories for conformal-safety
Users that are interested in conformal-safety are comparing it to the libraries listed below
Sorting:
- Conformal Language Modeling☆32Updated last year
- ☆33Updated last year
- ☆40Updated last year
- ☆241Updated last year
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]☆203Updated 4 months ago
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆75Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆220Updated last year
- ☆102Updated last year
- [ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift☆111Updated 2 years ago
- ☆136Updated this week
- ☆22Updated last year
- ☆110Updated 9 months ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆42Updated this week
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆99Updated 9 months ago
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆20Updated 2 years ago
- Official Implementation of the paper: "A Rate-Distorion View of Uncertainty Quantification", ICML 2024☆28Updated last year
- A Python Data Valuation Package☆30Updated 2 years ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆132Updated last year
- SpuCo is a Python package developed to further research to address spurious correlations.☆24Updated 10 months ago
- MedSafetyBench: Evaluating and Improving the Medical Safety of LLMs, NeurIPS 2024☆34Updated 4 months ago
- Using sparse coding to find distributed representations used by neural networks.☆283Updated 2 years ago
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Updated 2 years ago
- A package for conformal prediction with conditional guarantees.☆67Updated last month
- [ICML 2023] Official repository of paper: Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repe…☆25Updated 3 months ago
- ☆53Updated 10 months ago
- ☆24Updated 7 months ago
- Extending Conformal Prediction to LLMs☆68Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆107Updated 2 years ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆68Updated 2 years ago
- Code for paper: Are Large Language Models Post Hoc Explainers?☆34Updated last year