JJchy / CG_score
Data Valuation without Training of a Model, submitted to ICLR'23
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for CG_score
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆49Updated last week
- ☆38Updated last year
- ☆31Updated last year
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆46Updated last month
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆13Updated 9 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆26Updated 4 months ago
- ☆19Updated last month
- ☆15Updated 3 months ago
- Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆10Updated 4 months ago
- ☆15Updated this week
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆20Updated 5 months ago
- ☆34Updated 3 months ago
- ☆76Updated 2 weeks ago
- The official code of the paper "A Closer Look at Machine Unlearning for Large Language Models".☆12Updated last month
- ☆33Updated last year
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆23Updated 9 months ago
- ☆20Updated last year
- source code for NeurIPS'24 paper "HaloScope: Harnessing Unlabeled LLM Generations for Hallucination Detection"☆18Updated last month
- ☆9Updated last year
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆27Updated 3 weeks ago
- [ICLR 2024] Provable Robust Watermarking for AI-Generated Text☆26Updated 11 months ago
- Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"☆41Updated 6 months ago
- Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]☆34Updated 2 weeks ago
- ☆10Updated 8 months ago
- ☆40Updated last year
- Landing Page for TOFU☆94Updated 5 months ago
- [Arxiv 2024] Adversarial attacks on multimodal agents☆37Updated 4 months ago
- ☆13Updated 8 months ago
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆17Updated 8 months ago
- The official code for the publication: "The Close Relationship Between Contrastive Learning and Meta-Learning".☆20Updated 2 years ago