jjcherian / conformal-safetyLinks
☆31Updated 7 months ago
Alternatives and similar repositories for conformal-safety
Users that are interested in conformal-safety are comparing it to the libraries listed below
Sorting:
- Conformal Language Modeling☆31Updated last year
- ☆32Updated last year
- [ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift☆108Updated 2 years ago
- ☆31Updated last year
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆41Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆71Updated 9 months ago
- ☆93Updated last year
- Layer-Wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]☆172Updated this week
- A fast, effective data attribution method for neural networks in PyTorch☆212Updated 7 months ago
- A Python Data Valuation Package☆31Updated 2 years ago
- Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023☆79Updated last year
- SpuCo is a Python package developed to further research to address spurious correlations.☆24Updated 6 months ago
- ☆27Updated last year
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆67Updated 2 years ago
- [ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…☆106Updated last year
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆12Updated 7 months ago
- Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.☆128Updated 8 months ago
- Bayesian low-rank adaptation for large language models☆23Updated last year
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆99Updated 5 months ago
- ☆99Updated 5 months ago
- [ICML 2023] Official repository of paper: Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repe…☆24Updated last year
- Repository for our NeurIPS 2022 paper "Concept Embedding Models: Beyond the Accuracy-Explainability Trade-Off" and our NeurIPS 2023 paper…☆63Updated last month
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Updated last year
- ☆171Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆47Updated 3 months ago
- code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"☆122Updated last year
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆19Updated last year
- ☆38Updated 3 months ago
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆24Updated 6 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆79Updated last year