parameterlab / apricotLinks
Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024
☆22Updated last year
Alternatives and similar repositories for apricot
Users that are interested in apricot are comparing it to the libraries listed below
Sorting:
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16Updated 2 years ago
- Data Valuation without Training of a Model, submitted to ICLR'23☆22Updated 3 years ago
- ☆33Updated 2 months ago
- ☆80Updated 3 years ago
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆64Updated 9 months ago
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆20Updated last year
- ☆29Updated 3 years ago
- ☆32Updated last year
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Updated last year
- ☆32Updated 2 years ago
- Restore safety in fine-tuned language models through task arithmetic☆31Updated last year
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Updated 2 years ago
- ☆41Updated last year
- ☆51Updated 2 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆108Updated 2 years ago
- ☆103Updated last year
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆29Updated last year
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆83Updated last year
- Code release for Dataless Knowledge Fusion by Merging Weights of Language Models (https://openreview.net/forum?id=FCnohuR6AnM)☆92Updated 2 years ago
- Code for the paper "A Whac-A-Mole Dilemma Shortcuts Come in Multiples Where Mitigating One Amplifies Others"☆51Updated last year
- ☆43Updated 2 years ago
- ☆25Updated 2 years ago
- Test-time-training on nearest neighbors for large language models☆49Updated last year
- ☆53Updated 9 months ago
- Residual Prompt Tuning: a method for faster and better prompt tuning.☆57Updated 2 years ago
- ☆46Updated last year
- AI Logging for Interpretability and Explainability🔬☆138Updated last year
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆32Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆79Updated last year
- ☆37Updated last year