parameterlab / apricotLinks
Source code of "Calibrating Large Language Models Using Their Generations Only", ACL2024
☆17Updated 7 months ago
Alternatives and similar repositories for apricot
Users that are interested in apricot are comparing it to the libraries listed below
Sorting:
- ☆44Updated 4 months ago
- ☆35Updated 6 months ago
- ☆44Updated last year
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆17Updated last year
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆26Updated last year
- ☆30Updated 11 months ago
- Data Valuation without Training of a Model, submitted to ICLR'23☆22Updated 2 years ago
- ☆69Updated 3 years ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16Updated 2 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Updated last year
- ☆54Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Updated last year
- ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆35Updated 4 months ago
- Code for "Universal Adversarial Triggers Are Not Universal."☆17Updated last year
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆19Updated this week
- ☆28Updated 3 months ago
- ☆22Updated 11 months ago
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆30Updated 5 months ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆77Updated 6 months ago
- [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"☆58Updated 2 weeks ago
- ☆44Updated 3 months ago
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆46Updated 2 months ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMs☆36Updated last year
- ☆14Updated last year
- ☆21Updated last year
- ☆40Updated last year
- Official PyTorch implementation of "Neural Relation Graph: A Unified Framework for Identifying Label Noise and Outlier Data" (NeurIPS'23)☆15Updated last year
- ☆14Updated last year
- Test-time-training on nearest neighbors for large language models☆41Updated last year