yyy01 / PACLinks
The official implementation of the paper "Data Contamination Calibration for Black-box LLMs" (ACL 2024)
☆14Updated last year
Alternatives and similar repositories for PAC
Users that are interested in PAC are comparing it to the libraries listed below
Sorting:
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆97Updated last year
- [ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档:一个基于注意力的简单方案☆24Updated 9 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆166Updated 7 months ago
- ☆30Updated 8 months ago
- Toolkit for evaluating the trustworthiness of generative foundation models.☆123Updated 3 months ago
- Code and data repository for "The Mirage of Model Editing: Revisiting Evaluation in the Wild"☆16Updated 3 months ago
- Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"☆87Updated 9 months ago
- Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …☆78Updated this week
- ☆40Updated 2 years ago
- ☆21Updated 8 months ago
- A curated list of resources for activation engineering☆114Updated 2 months ago
- "In-Context Unlearning: Language Models as Few Shot Unlearners". Martin Pawelczyk, Seth Neel* and Himabindu Lakkaraju*; ICML 2024.☆28Updated 2 years ago
- ☆53Updated last year
- Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic☆31Updated 2 months ago
- This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…☆33Updated 8 months ago
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆88Updated 11 months ago
- [NeurIPS 2024] "Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?"☆37Updated 4 months ago
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆80Updated 10 months ago
- [ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"☆11Updated 9 months ago
- Principled Data Selection for Alignment: The Hidden Risks of Difficult Examples☆44Updated 4 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆36Updated 10 months ago
- [ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs☆50Updated 6 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆72Updated 9 months ago
- The reinforcement learning codes for dataset SPA-VL☆42Updated last year
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆68Updated last year
- Awesome-Low-Rank-Adaptation☆123Updated last year
- ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)☆34Updated last year
- [ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"☆13Updated 9 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆51Updated last year
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆85Updated last year