jjcherian / conformal-safetyLinks
☆32Updated last year
Alternatives and similar repositories for conformal-safety
Users that are interested in conformal-safety are comparing it to the libraries listed below
Sorting:
- Conformal Language Modeling☆32Updated 2 years ago
- ☆33Updated last year
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]☆213Updated 5 months ago
- ☆40Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆224Updated last year
- OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)☆99Updated 11 months ago
- SpuCo is a Python package developed to further research to address spurious correlations.☆24Updated 11 months ago
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆42Updated last month
- Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)☆21Updated 2 years ago
- [ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift☆111Updated 2 years ago
- ☆23Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Updated last year
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆84Updated last year
- A Python Data Valuation Package☆30Updated 2 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Updated last year
- ☆104Updated last year
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Updated 2 years ago
- Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)☆68Updated 2 years ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆41Updated last year
- ☆24Updated 8 months ago
- [ICML 2023] Official repository of paper: Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repe…☆25Updated 5 months ago
- A simple PyTorch implementation of influence functions.☆92Updated last year
- A repository for summaries of recent explainable AI/Interpretable ML approaches☆88Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆64Updated 8 months ago
- Extending Conformal Prediction to LLMs☆68Updated last year
- Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024☆22Updated last year
- Repository for our NeurIPS 2022 paper "Concept Embedding Models", our NeurIPS 2023 paper "Learning to Receive Help", and our ICML 2025 pa…☆72Updated 2 months ago
- ☆241Updated last year
- ☆140Updated last week
- A package for conformal prediction with conditional guarantees.☆67Updated 3 months ago