shizhouxing / LLM-Detector-RobustnessLinks
[TACL] Code for "Red Teaming Language Model Detectors with Language Models"
☆21Updated last year
Alternatives and similar repositories for LLM-Detector-Robustness
Users that are interested in LLM-Detector-Robustness are comparing it to the libraries listed below
Sorting:
- The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"☆66Updated 6 months ago
- Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]☆79Updated 8 months ago
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆63Updated last year
- Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection☆40Updated last year
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆61Updated 4 months ago
- A curated list of trustworthy Generative AI papers. Daily updating...☆73Updated 9 months ago
- ☆45Updated last year