hartvigsen-group / composable-interventionsLinks

☆29

Alternatives and similar repositories for composable-interventions

Users that are interested in composable-interventions are comparing it to the libraries listed below

Sorting:

jiahai-feng / binding-iclr
☆16Updated last year
hamishivi / automated-instruction-selection
Exploration of automated dataset selection approaches at large scales.
☆48Updated 8 months ago
haotiansun14 / BBox-Adapter
Lightweight Adapting for Black-Box Large Language Models
☆24Updated last year
MadryLab / DsDm
☆51Updated last year
peterljq / Parsimonious-Concept-Engineering
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆40Updated last year
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆141Updated 4 months ago
zzwjames / FailureLLMUnlearning
An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)
☆33Updated 8 months ago
UCSB-NLP-Chang / llm_uncertainty
☆40Updated last year
jinhaoduan / SAR
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆59Updated last year
tatsu-lab / test_set_contamination
☆41Updated 2 years ago
bpwu1 / confidence-regulation-neurons
Confidence Regulation Neurons in Language Models (NeurIPS 2024)
☆14Updated 9 months ago
maszhongming / ParaKnowTransfer
Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"
☆32Updated last year
socialfoundations / tttlm
Test-time-training on nearest neighbors for large language models
☆46Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆82Updated 10 months ago
tianyi-lab / Mosaic-IT
[ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning
☆20Updated last month
facebookresearch / ModelRatatouille
Recycling diverse models
☆46Updated 2 years ago
activatedgeek / calibration-tuning
☆52Updated 7 months ago
bowen-upenn / llm_token_bias
[EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners
☆26Updated 11 months ago
saprmarks / geometry-of-truth
☆94Updated last year
snap-stanford / optimas
Optimize Any User-defined Compound AI Systems
☆62Updated 3 months ago
stellalisy / mediQ
☆32Updated 9 months ago
ylsung / vl-merging
PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"
☆37Updated 2 years ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆64Updated last year
zjunlp / PitfallsKnowledgeEditing
[ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models
☆22Updated last year
formll / resolving-scaling-law-discrepancies
☆20Updated 2 weeks ago
zjunlp / KnowledgeCircuits
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
☆159Updated this week
probabilistic-inference-scaling / probabilistic-inference-scaling
☆52Updated 8 months ago
allenai / hyper-task-descriptions
Learning adapter weights from task descriptions
☆19Updated 2 years ago
katiekang1998 / reasoning_generalization
☆33Updated 10 months ago
vedantpalit / Towards-Vision-Language-Mechanistic-Interpretability
This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…
☆23Updated last year