sail-sg / D-TRAK
Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)
☆28Updated last year
Alternatives and similar repositories for D-TRAK:
Users that are interested in D-TRAK are comparing it to the libraries listed below
- Official pytorch implementation of "Interpreting the Second-Order Effects of Neurons in CLIP"☆33Updated 3 months ago
- [TMLR 2025] On Memorization in Diffusion Models☆24Updated last year
- What do we learn from inverting CLIP models?☆49Updated 11 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆21Updated last year
- ☆12Updated 11 months ago
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆49Updated 4 months ago
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆71Updated 7 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆42Updated 6 months ago
- ☆30Updated 2 months ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆75Updated last year
- ☆52Updated last year
- ☆23Updated 2 months ago
- Code for Debiasing Vision-Language Models via Biased Prompts☆56Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆32Updated 3 months ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆69Updated 3 months ago
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆18Updated 4 months ago
- This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.☆26Updated 9 months ago
- ☆17Updated 7 months ago
- ☆37Updated 3 months ago
- [Arxiv 2024] Dissecting Adversarial Robustness of Multimodal LM Agents☆60Updated last month
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆18Updated 11 months ago
- ☆33Updated 5 months ago
- Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)☆40Updated 10 months ago
- ☆28Updated 7 months ago
- A simple and efficient baseline for data attribution☆11Updated last year
- ☆21Updated 7 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆66Updated 10 months ago
- [ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"☆52Updated 4 months ago
- Host CIFAR-10.2 Data Set☆13Updated 3 years ago