pratyushmaini/llm_dataset_inference

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pratyushmaini/llm_dataset_inference)

pratyushmaini / llm_dataset_inference

Official Repository for Dataset Inference for LLMs

☆41

Alternatives and similar repositories for llm_dataset_inference

Users that are interested in llm_dataset_inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

parameterlab / mia-scaling
View on GitHub
Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"
☆16Dec 16, 2025Updated 7 months ago
locuslab / T-MARS
View on GitHub
Code for T-MARS data filtering
☆35Aug 23, 2023Updated 2 years ago
JTWang2000 / FreeShap
View on GitHub
Fine-tuning-free Shapley value (FreeShap) for instance attribution
☆14May 29, 2024Updated 2 years ago
locuslab / scaling_laws_data_filtering
View on GitHub
☆64Apr 9, 2024Updated 2 years ago
iamgroot42 / mimir
View on GitHub
Python package for measuring memorization in LLMs.
☆195Jul 16, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ali7naseh / RAG_MIA
View on GitHub
☆15Jun 28, 2025Updated last year
cleverhans-lab / dataset-inference
View on GitHub
[ICLR'21] Dataset Inference for Ownership Resolution in Machine Learning
☆31Oct 10, 2022Updated 3 years ago
eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago
yidingjiang / ado
View on GitHub
The repository contains code for Adaptive Data Optimization
☆37Dec 9, 2024Updated last year
locuslab / acr-memorization
View on GitHub
☆41Dec 19, 2024Updated last year
alisawuffles / tokenizer-attack
View on GitHub
Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"
☆23May 15, 2025Updated last year
py85252876 / Reconstruction-based-Attack
View on GitHub
☆19Jul 18, 2024Updated 2 years ago
dell-research-harvard / NEWS-COPY
View on GitHub
Noise-robust de-duplication at scale
☆19Apr 9, 2023Updated 3 years ago
r-three / realistic_evaluation_of_model_merging_for_compositional_generalization
View on GitHub
☆13Feb 11, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mengtong0110 / Tokenizer-MIA
View on GitHub
[USENIX Security 2026] Membership Inference Attacks on Tokenizers of Large Language Models
☆21May 22, 2026Updated 2 months ago
pietrolesci / memorisation-profiles
View on GitHub
This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".
☆25Mar 25, 2025Updated last year
zhiAung / Finger-vein-recognition
View on GitHub
Finger vein recognition in biometrics
☆10Jun 30, 2019Updated 7 years ago
locuslab / robust_union
View on GitHub
[ICML'20] Multi Steepest Descent (MSD) for robustness against the union of multiple perturbation models.
☆25Jul 25, 2024Updated 2 years ago
y0mingzhang / diffuse-distributions
View on GitHub
Forcing Diffuse Distributions out of Language Models
☆18Sep 10, 2024Updated last year
eth-sri / SynthPAI
View on GitHub
A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)
☆59Jul 27, 2025Updated last year
mireshghallah / ft-memorization
View on GitHub
☆13Oct 20, 2022Updated 3 years ago
vfleaking / PTST
View on GitHub
Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"
☆22Sep 21, 2025Updated 10 months ago
kowndinya-renduchintala / POSIX
View on GitHub
POSIX: A Prompt Sensitivity Index for Language Models
☆13Nov 13, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tml-epfl / long-is-more-for-alignment
View on GitHub
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]
☆21May 2, 2024Updated 2 years ago
fKunstner / dataset-downloader
View on GitHub
☆13Aug 15, 2024Updated last year
o-laurent / multivariate-ks-test
View on GitHub
Python implementation of an extension of the Kolmogorov-Smirnov test for multivariate samples
☆13Aug 6, 2023Updated 2 years ago
xszheng2020 / memorization
View on GitHub
An Empirical Study of Memorization in NLP (ACL 2022)
☆13Jun 22, 2022Updated 4 years ago
jonasrauber / linear-region-attack
View on GitHub
A powerful white-box adversarial attack that exploits knowledge about the geometry of neural networks to find minimal adversarial perturb…
☆12Aug 5, 2020Updated 5 years ago
AbhilashaRavichander / information-probing
View on GitHub
☆11May 18, 2025Updated last year
YukeHu / vlm_mia
View on GitHub
Code for paper "Membership Inference Attacks Against Vision-Language Models"
☆31Jan 25, 2025Updated last year
eth-sri / privacy-inference-multimodal
View on GitHub
☆21Feb 3, 2025Updated last year
TIGER-AI-Lab / TableCoT
View on GitHub
The code and data for paper "Large Language Models are few(1)-shot Table Reasoners" [EACL2023]
☆47Apr 30, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
nikhilchandak / answer-matching
View on GitHub
Code for 'Answer Matching Outperforms Multiple Choice for Language Model Evaluation' paper
☆18Jul 4, 2025Updated last year
eth-sri / llm-anonymization
View on GitHub
☆23May 23, 2025Updated last year
ethz-spylab / superhuman-ai-consistency
View on GitHub
☆30Jun 19, 2023Updated 3 years ago
facebookresearch / synlm
View on GitHub
Code for paper: "Privately generating tabular data using language models".
☆16Jun 13, 2023Updated 3 years ago
amro-kamal / ObjectPose
View on GitHub
☆13Jul 19, 2022Updated 4 years ago
VITA-Group / Robust_Weight_Signatures
View on GitHub
[ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang
☆16May 4, 2023Updated 3 years ago
cindyxinyiwang / TrDec_pytorch
View on GitHub
☆36Oct 3, 2018Updated 7 years ago