Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation"
☆23May 8, 2023Updated 2 years ago
Alternatives and similar repositories for Targeted-Data-Extraction
Users that are interested in Targeted-Data-Extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆40May 19, 2023Updated 2 years ago
- ☆304Apr 8, 2026Updated 3 weeks ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆104Aug 13, 2024Updated last year
- ☆13Oct 20, 2022Updated 3 years ago
- Code for our NeurIPS 2023 paper Towards Evaluating Transfer-based Attacks Systematically, Practically, and Fairly☆14Jan 22, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official Code for NAACL 2022 paper: "Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation"☆16Sep 1, 2022Updated 3 years ago
- Python package for measuring memorization in LLMs.☆189Jul 16, 2025Updated 9 months ago
- ☆14May 8, 2024Updated last year
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆29Sep 25, 2025Updated 7 months ago
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆111Aug 7, 2024Updated last year
- ☆23Dec 28, 2023Updated 2 years ago
- About Official PyTorch implementation of "Query-Efficient Black-Box Red Teaming via Bayesian Optimization" (ACL'23)☆15Jul 9, 2023Updated 2 years ago
- ☆12Jul 18, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆54Dec 20, 2023Updated 2 years ago
- for DTCA model☆10Oct 17, 2023Updated 2 years ago
- ☆29Aug 31, 2025Updated 8 months ago
- Official repository for “Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space”☆18Jan 27, 2026Updated 3 months ago
- Official implementation for "Instruction Tuning with Retrieval-based Examples Ranking for Aspect-based Sentiment Analysis"☆13Mar 23, 2026Updated last month
- Official implementation for KDD25 paper "GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning"☆22Jul 10, 2025Updated 9 months ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 10 months ago
- [ACL2024] Progressively Modality Freezing for Multi-Modal Entity Alignment☆18Apr 10, 2025Updated last year
- ☆42May 23, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The code for "MoPE: Mixture of Prefix Experts for Zero-Shot Dialogue State Tracking"☆19Jan 25, 2025Updated last year