zhang-wei-chao / DC-PDDLinks
This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method by Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng
☆17Updated 3 months ago
Alternatives and similar repositories for DC-PDD
Users that are interested in DC-PDD are comparing it to the libraries listed below
Sorting:
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆14Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆128Updated 10 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆80Updated 3 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆116Updated last year
- [ICLR 2025] Released code for paper "Spurious Forgetting in Continual Learning of Language Models"☆50Updated 3 months ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆15Updated 7 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆61Updated last year
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆82Updated 11 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆69Updated last year
- LLM Unlearning☆174Updated last year
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆97Updated last year
- [ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"☆78Updated 8 months ago
- ☆47Updated last year
- ☆30Updated 6 months ago
- [ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs☆43Updated 3 months ago
- Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …☆35Updated 7 months ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆41Updated 9 months ago
- ☆50Updated last month
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆103Updated 3 weeks ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆68Updated 2 years ago
- A method of ensemble learning for heterogeneous large language models.☆60Updated last year
- Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆40Updated 4 months ago
- ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.☆87Updated last year
- LoFiT: Localized Fine-tuning on LLM Representations☆40Updated 7 months ago
- ☆41Updated 11 months ago
- ☆66Updated 4 months ago
- The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”☆17Updated last year
- 【ACL 2024】 SALAD benchmark & MD-Judge☆158Updated 5 months ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆15Updated last year
- Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning☆34Updated 9 months ago