Varal7 / conformal-language-modelingLinks

Conformal Language Modeling

☆31

Alternatives and similar repositories for conformal-language-modeling

Users that are interested in conformal-language-modeling are comparing it to the libraries listed below

Sorting:

tatsu-lab / conformal-factual-lm
☆32Updated last year
zlin7 / UQ-NLG
☆97Updated last year
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆73Updated 10 months ago
balevinstein / Probes
☆52Updated 2 years ago
UCSB-NLP-Chang / llm_uncertainty
☆32Updated last year
ZaydH / influence_analysis_papers
Influence Analysis and Estimation - Survey, Papers, and Taxonomy
☆80Updated last year
pomonam / kronfluence
Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature
☆157Updated last month
MadryLab / trak
A fast, effective data attribution method for neural networks in PyTorch
☆214Updated 8 months ago
jjcherian / conformal-safety
☆31Updated 8 months ago
TRAIS-Lab / dattri
`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.
☆81Updated last month
p-lambda / incontext-learning
Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…
☆108Updated last year
adamxyang / laplace-lora
Bayesian low-rank adaptation for large language models
☆23Updated last year
dtsip / in-context-learning
☆234Updated last year
opendataval / opendataval
OpenDataVal: a Unified Benchmark for Data Valuation in Python (NeurIPS 2023)
☆99Updated 6 months ago
kawine / dataset_difficulty
"Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)
☆87Updated last year
tatsu-lab / linguistic_calibration
Align your LM to express calibrated verbal statements of confidence in its long-form generations.
☆27Updated last year
explanare / ravel
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
☆52Updated 10 months ago
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆74Updated 4 months ago
alstonlo / torch-influence
A simple PyTorch implementation of influence functions.
☆89Updated last year
logix-project / logix
AI Logging for Interpretability and Explainability🔬
☆125Updated last year
lorenzkuhn / semantic_uncertainty
☆171Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆78Updated 7 months ago
deeplearning-wisc / args
☆43Updated last year
causalNLP / corr2cause
Data and code for the Corr2Cause paper (ICLR 2024)
☆108Updated last year
IBM / activation-steering
[ICLR 2025] General-purpose activation steering library
☆87Updated last week
zepingyu0512 / in-context-mechanism
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…
☆13Updated 8 months ago
adamkarvonen / SAEBench
☆109Updated 3 weeks ago
DeqingFu / transformers-icl-second-order
Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…
☆17Updated 8 months ago
KihoPark / linear_rep_geometry
☆103Updated 5 months ago
pratyushmaini / localizing-memorization
Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"
☆19Updated last year