UCSB-NLP-Chang / llm_uncertainty
☆26Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm_uncertainty
- ☆21Updated last month
- Bayesian low-rank adaptation for large language models☆23Updated 6 months ago
- [NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors☆69Updated 8 months ago
- ☆38Updated last year
- ☆25Updated 4 months ago
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆14Updated 3 weeks ago
- ☆12Updated 5 months ago
- Lightweight Adapting for Black-Box Large Language Models☆18Updated 9 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆28Updated 4 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆33Updated this week
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆53Updated last month
- ☆31Updated last year
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆19Updated 5 months ago
- ☆26Updated 6 months ago
- ☆77Updated 4 months ago
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆16Updated 2 months ago
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)☆40Updated 7 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆47Updated last month
- Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆10Updated 5 months ago
- ☆24Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- ☆33Updated 9 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆21Updated 6 months ago
- Official code for the paper: Evaluating Copyright Takedown Methods for Language Models☆15Updated 4 months ago
- Code for "Surgical Fine-Tuning Improves Adaptation to Distribution Shifts" published at ICLR 2023☆28Updated last year
- ☆26Updated 3 weeks ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆54Updated 2 weeks ago
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)☆21Updated 4 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆63Updated 8 months ago
- ☆9Updated last year