vinusankars / Reliability-of-AI-text-detectorsLinks

Can AI-Generated Text be Reliably Detected?

☆81

Alternatives and similar repositories for Reliability-of-AI-text-detectors

Users that are interested in Reliability-of-AI-text-detectors are comparing it to the libraries listed below

Sorting:

martiansideofthemoon / ai-detection-paraphrases
Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…
☆173Updated last year
ICTMCG / Awesome-Machine-Generated-Text
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.
☆223Updated 2 months ago
junchaoIU / LLM-generated-Text-Detection
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…
☆76Updated 8 months ago
Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
☆274Updated last month
amazon-science / controlling-llm-memorization
☆36Updated 2 years ago
liamdugan / raid
RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)
☆79Updated last week
NLP2CT / LLM-generated-Text-Detection
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…
☆225Updated 7 months ago
XuandongZhao / weak-to-strong
[ICML 2025] Weak-to-Strong Jailbreaking on Large Language Models
☆84Updated 3 months ago
eric-mitchell / detect-gpt
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
☆416Updated 2 years ago
thunlp / Advbench
Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversaria…
☆53Updated 2 years ago
OSU-NLP-Group / AmpleGCG
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
☆69Updated 9 months ago
Jihuai-wpy / SeqXGPT
SeqXGPT: An advance method for sentence-level AI-generated text detection.
☆92Updated last year
pratyushmaini / llm_dataset_inference
Official Repository for Dataset Inference for LLMs
☆36Updated last year
paul-rottger / xstest
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
☆106Updated 5 months ago
mbzuai-nlp / M4
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
☆30Updated last year
datamllab / awsome-LLM-generated-text-detection
☆28Updated 2 years ago
declare-lab / red-instruct
Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
☆103Updated last year
iamgroot42 / mimir
Python package for measuring memorization in LLMs.
☆161Updated 3 weeks ago
allenai / wildguard
Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs
☆87Updated 8 months ago
jthickstun / watermark
Code for watermarking language models
☆80Updated 11 months ago
Dongping-Chen / MixSet
(NAACL 2024) Official code repository for Mixset.
☆26Updated 8 months ago
collinzrj / output2prompt
☆44Updated 4 months ago
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆172Updated last year
Princeton-SysML / Jailbreak_LLM
☆178Updated last year
llm-misinformation / llm-misinformation
The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"
☆71Updated 9 months ago
SALT-NLP / chain-of-thought-bias
☆28Updated 10 months ago
mbzuai-nlp / DetectLLM
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
☆30Updated 2 years ago
vivek3141 / ghostbuster
Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)
☆160Updated last year
AlexWan0 / Poisoning-Instruction-Tuned-Models
☆57Updated last year
facebookresearch / three_bricks
Official Implementation of the paper "Three Bricks to Consolidate Watermarks for LLMs"
☆48Updated last year