liamdugan / raidLinks

RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)

☆96

Alternatives and similar repositories for raid

Users that are interested in raid are comparing it to the libraries listed below

Sorting:

ICTMCG / Awesome-Machine-Generated-Text
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.
☆228Updated 5 months ago
Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
☆280Updated 5 months ago
NLP2CT / LLM-generated-Text-Detection
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…
☆235Updated 10 months ago
martiansideofthemoon / ai-detection-paraphrases
Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…
☆179Updated 2 years ago
vinusankars / Reliability-of-AI-text-detectors
Can AI-Generated Text be Reliably Detected?
☆86Updated 2 years ago
thunlp / Advbench
Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversaria…
☆61Updated 2 years ago
junchaoIU / LLM-generated-Text-Detection
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…
☆79Updated last year
ryuryukke / OUTFOX
[AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …
☆51Updated 2 weeks ago
Jihuai-wpy / SeqXGPT
SeqXGPT: An advance method for sentence-level AI-generated text detection.
☆94Updated 2 years ago
OpenSafetyLab / SALAD-BENCH
【ACL 2024】 SALAD benchmark & MD-Judge
☆166Updated 8 months ago
cvlab-columbia / RaidarLLMDetect
☆31Updated last year
declare-lab / red-instruct
Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
☆107Updated last year
paul-rottger / xstest
Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"
☆116Updated 8 months ago
swj0419 / detect-pretrain-code
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆233Updated 2 years ago
LLNL / LUAR
Transformer-based model for learning authorship representations.
☆45Updated last year
vinid / safety-tuned-llamas
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
☆89Updated last year
mbzuai-nlp / DetectLLM
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
☆31Updated 2 years ago
SALT-NLP / chain-of-thought-bias
☆28Updated last year
Dongping-Chen / MixSet
(NAACL 2024) Official code repository for Mixset.
☆27Updated 11 months ago
mbzuai-nlp / M4
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
☆37Updated last year
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆177Updated 2 years ago
chujiezheng / LLM-Safeguard
Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"
☆99Updated 6 months ago
DAMO-NLP-SG / multilingual-safety-for-LLMs
[ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"
☆93Updated last year
NLP2CT / DetectRL
[NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
☆39Updated 11 months ago
eric-mitchell / detect-gpt
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
☆440Updated 2 years ago
niconi19 / LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
☆107Updated last year
TrustedLLM / LLMDet
LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).
☆82Updated last year
hzy312 / Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
☆363Updated 11 months ago
baoguangsheng / fast-detect-gpt
Code base for ICLR 2024 "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".
☆352Updated 2 months ago
i-gallegos / Fair-LLM-Benchmark
☆156Updated 2 years ago