jjcherian / conformal-safetyLinks

☆31

Alternatives and similar repositories for conformal-safety

Users that are interested in conformal-safety are comparing it to the libraries listed below

Sorting:

Varal7 / conformal-language-modeling
Conformal Language Modeling
☆31Updated last year
tatsu-lab / conformal-factual-lm
☆32Updated last year
YyzHarry / SubpopBench
[ICML 2023] Change is Hard: A Closer Look at Subpopulation Shift
☆108Updated 2 years ago
UCSB-NLP-Chang / llm_uncertainty
☆31Updated last year
Wuyxin / DISC
(ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation
☆41Updated last year
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆71Updated 9 months ago
zlin7 / UQ-NLG
☆93Updated last year
rachtibat / LRP-eXplains-Transformers
Layer-Wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]
☆172Updated this week
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆212Updated 7 months ago
uvanlp / valda
A Python Data Valuation Package
☆31Updated 2 years ago
mertyg / post-hoc-cbm
Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023
☆79Updated last year
BigML-CS-UCLA / SpuCo
SpuCo is a Python package developed to further research to address spurious correlations.
☆24Updated 6 months ago
aengusl / spawrious
☆27Updated last year
huaxiuyao / Wild-Time
Benchmark for Natural Temporal Distribution Shift (NeurIPS 2022)
☆67Updated 2 years ago
Trustworthy-ML-Lab / Label-free-CBM
[ICLR 23] A new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled c…
☆106Updated last year
zepingyu0512 / in-context-mechanism
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆12Updated 7 months ago
UW-Madison-Lee-Lab / LanguageInterfacedFineTuning
Code for Language-Interfaced FineTuning for Non-Language Machine Learning Tasks.
☆128Updated 8 months ago
adamxyang / laplace-lora
Bayesian low-rank adaptation for large language models
☆23Updated last year
opendataval / opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
☆99Updated 5 months ago
KihoPark / linear_rep_geometry
☆99Updated 5 months ago
batmanlab / ICML-2023-Route-interpret-repeat
[ICML 2023] Official repository of paper: Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repe…
☆24Updated last year
mateoespinosa / cem
Repository for our NeurIPS 2022 paper "Concept Embedding Models: Beyond the Accuracy-Explainability Trade-Off" and our NeurIPS 2023 paper…
☆63Updated last month
YanNeu / spurious_imagenet
Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet
☆32Updated last year
lorenzkuhn / semantic_uncertainty
☆171Updated last year
deeplearning-wisc / haloscope
source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"
☆47Updated 3 months ago
MiaoXiong2320 / llm-uncertainty
code repo for ICLR 2024 paper "Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs"
☆122Updated last year
ykwon0407 / dataoob
Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value (ICML 2023)
☆19Updated last year
Ybakman / TruthTorchLM
☆38Updated 3 months ago
AlexanderVNikitin / kernel-language-entropy
Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)
☆24Updated 6 months ago
ZaydH / influence_analysis_papers
Influence Analysis and Estimation - Survey, Papers, and Taxonomy
☆79Updated last year