eric-mitchell / detect-gptLinks

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature

☆416

Alternatives and similar repositories for detect-gpt

Users that are interested in detect-gpt are comparing it to the libraries listed below

Sorting:

Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
☆274Updated last month
NLP2CT / LLM-generated-Text-Detection
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…
☆225Updated 7 months ago
martiansideofthemoon / ai-detection-paraphrases
Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…
☆173Updated last year
ICTMCG / Awesome-Machine-Generated-Text
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.
☆223Updated 2 months ago
jwkirchenbauer / lm-watermarking
☆609Updated last month
junchaoIU / LLM-generated-Text-Detection
A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…
☆76Updated 8 months ago
vinusankars / Reliability-of-AI-text-detectors
Can AI-Generated Text be Reliably Detected?
☆81Updated last year
hzy312 / Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
☆351Updated 7 months ago
Jihuai-wpy / SeqXGPT
SeqXGPT: An advance method for sentence-level AI-generated text detection.
☆92Updated last year
baoguangsheng / fast-detect-gpt
Code base for ICLR 2024 "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".
☆322Updated 4 months ago
Dongping-Chen / MixSet
(NAACL 2024) Official code repository for Mixset.
☆26Updated 8 months ago
BurhanUlTayyab / DetectGPT
Pytorch implementation of DetectGPT (https://arxiv.org/pdf/2301.11305v1.pdf)
☆211Updated last year
TrustedLLM / LLMDet
LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).
☆77Updated last year
microsoft / TOXIGEN
This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.
☆325Updated last year
LLM-Tuning-Safety / LLMs-Finetuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…
☆314Updated last year
allenai / real-toxicity-prompts
☆216Updated 4 years ago
liamdugan / raid
RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)
☆79Updated last week
mingkaid / rl-prompt
Accompanying repo for the RLPrompt paper
☆342Updated last year
RUCAIBox / HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆497Updated last year
ryuryukke / OUTFOX
[AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …
☆44Updated 4 months ago
google-research / lm-extraction-benchmark
☆293Updated this week
swj0419 / detect-pretrain-code
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆228Updated last year
thunlp / Advbench
Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversaria…
☆53Updated 2 years ago
shmsw25 / FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆366Updated 3 months ago
niconi19 / LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
☆106Updated last year
jthickstun / watermark
Code for watermarking language models
☆80Updated 11 months ago
OpenSafetyLab / SALAD-BENCH
【ACL 2024】 SALAD benchmark & MD-Judge
☆156Updated 5 months ago
i-gallegos / Fair-LLM-Benchmark
☆141Updated last year
HowieHwong / TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
☆586Updated last month
AI-secure / DecodingTrust
A Comprehensive Assessment of Trustworthiness in GPT Models
☆299Updated 10 months ago