zhang-wei-chao / DC-PDDView external linksLinks
This repository presents the original implementation of Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method by Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng
☆22May 21, 2025Updated 8 months ago
Alternatives and similar repositories for DC-PDD
Users that are interested in DC-PDD are comparing it to the libraries listed below
Sorting:
- [ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs☆52May 26, 2025Updated 8 months ago
- ☆22Dec 22, 2024Updated last year
- The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"☆35Jun 13, 2025Updated 8 months ago
- ☆147Apr 16, 2024Updated last year
- EARAM for fake news detection☆12Dec 30, 2025Updated last month
- FTRL-Proximal Online Learning Algorithm☆15May 22, 2017Updated 8 years ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆13Mar 1, 2025Updated 11 months ago
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 7 years ago
- [WWW 25] USPTO-LLM: A Large Language Model-Assisted Information-enriched Chemical Reaction Dataset☆15Dec 12, 2024Updated last year
- RWKV Wiki website (archived, please visit official wiki)☆11Mar 26, 2023Updated 2 years ago
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆58Oct 28, 2025Updated 3 months ago
- Shadow Attack, LiRA, Quantile Regression and RMIA implementations in PyTorch (Online version)☆14Nov 8, 2024Updated last year
- ☆12Sep 26, 2024Updated last year
- ACL24☆11Jun 7, 2024Updated last year
- An unofficial pyotrch implementation of "ML-Leaks:Model and Data Independent Membership Inference Attacks and Defenses on ML Models"☆11Dec 23, 2023Updated 2 years ago
- Code for the paper "Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction" …☆12Sep 6, 2023Updated 2 years ago
- Character Embedding + ESIM + Focal Loss for Chinese Answer Sentence Selection☆10Jan 4, 2020Updated 6 years ago
- Causal Reasoning for Membership Inference Attacks☆11Oct 21, 2022Updated 3 years ago
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- Code for paper: Optimizing Length Compression in Large Reasoning Models☆27Oct 20, 2025Updated 3 months ago
- This project has included related source codes and datasets of our EMNLP2021 paper☆10May 28, 2022Updated 3 years ago
- 新词发现分布式机器学习算法。☆15Jul 21, 2014Updated 11 years ago
- allowing R users to work with dlib through Rcpp☆13Apr 11, 2018Updated 7 years ago
- Mathematical Analysis (et analyse fonctionnelle)☆14Feb 1, 2022Updated 4 years ago
- ☆10Dec 20, 2023Updated 2 years ago
- Encoder-decoders for translating different chemical formats.☆18Sep 17, 2025Updated 5 months ago
- Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.☆14Mar 18, 2024Updated last year
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated last year
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…☆19Jun 17, 2025Updated 8 months ago
- ☆11Nov 11, 2021Updated 4 years ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 9 months ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- ☆11Jun 4, 2021Updated 4 years ago
- ☆11Nov 13, 2024Updated last year
- Code of "Instruction Multi-Constraint Molecular Generation Using a Teacher-Student Large Language Model"☆14Jul 8, 2025Updated 7 months ago
- ☆10Jun 19, 2024Updated last year
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- Solution of KDD cup 2021☆11Jun 16, 2021Updated 4 years ago