pratyushmaini / llm_dataset_inferenceLinks
Official Repository for Dataset Inference for LLMs
☆40Updated last year
Alternatives and similar repositories for llm_dataset_inference
Users that are interested in llm_dataset_inference are comparing it to the libraries listed below
Sorting:
- ☆44Updated 7 months ago
- ☆57Updated 2 years ago
- ☆13Updated 2 years ago
- Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"☆13Updated 7 months ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆111Updated 6 months ago
- ☆39Updated 2 years ago
- [ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs☆44Updated 3 months ago
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆44Updated last year
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆64Updated last year
- ☆56Updated last year
- ☆37Updated 8 months ago
- Code for watermarking language models☆82Updated last year
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning