hartvigsen-group / composable-interventionsLinks
☆29Updated 8 months ago
Alternatives and similar repositories for composable-interventions
Users that are interested in composable-interventions are comparing it to the libraries listed below
Sorting:
- ☆16Updated last year
- Exploration of automated dataset selection approaches at large scales.☆48Updated 8 months ago
- Lightweight Adapting for Black-Box Large Language Models☆24Updated last year
- ☆51Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆40Updated last year
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆141Updated 4 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆33Updated 8 months ago
- ☆40Updated last year
- [ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models☆59Updated last year
- ☆41Updated 2 years ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆14Updated 9 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- Test-time-training on nearest neighbors for large language models☆46Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆82Updated 10 months ago
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆20Updated last month
- Recycling diverse models☆46Updated 2 years ago
- ☆52Updated 7 months ago
- [EMNLP 2024] A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners☆26Updated 11 months ago
- ☆94Updated last year
- Optimize Any User-defined Compound AI Systems☆62Updated 3 months ago
- ☆32Updated 9 months ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated 2 years ago
- Sparse and discrete interpretability tool for neural networks☆64Updated last year
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Updated last year
- ☆20Updated 2 weeks ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆159Updated this week
- ☆52Updated 8 months ago
- Learning adapter weights from task descriptions☆19Updated 2 years ago
- ☆33Updated 10 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆23Updated last year