eric-mitchell / detect-gptLinks
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
☆434Updated 2 years ago
Alternatives and similar repositories for detect-gpt
Users that are interested in detect-gpt are comparing it to the libraries listed below
Sorting:
- ☆629Updated last month
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆175Updated last year
- The lastest paper about detection of LLM-generated text and code☆278Updated 3 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆232Updated 9 months ago
- Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.☆226Updated 4 months ago
- Code base for ICLR 2024 "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆347Updated last month
- This repo contains the code for generating the ToxiGen dataset, published at ACL 2022.☆332Updated last year
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆357Updated 10 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆78Updated 10 months ago
- This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.☆515Updated last year
- Can AI-Generated Text be Reliably Detected?☆85Updated last year
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆87Updated last week
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆93Updated 2 years ago
- Pytorch implementation of DetectGPT (https://arxiv.org/pdf/2301.11305v1.pdf)☆213Updated last year
- ☆293Updated 2 months ago
- We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…☆324Updated last year
- [ICML 2024] TrustLLM: Trustworthiness in Large Language Models☆600Updated 3 months ago
- Accompanying repo for the RLPrompt paper☆356Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆233Updated last year
- ☆160Updated 8 months ago
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆77Updated last year
- Code and data of the EMNLP 2022 paper "Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversaria…☆59Updated 2 years ago
- (NAACL 2024) Official code repository for Mixset.☆27Updated 10 months ago
- [NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey☆106Updated last year
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)☆162Updated last year
- ☆152Updated 2 years ago
- [AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …☆46Updated 6 months ago
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆33Updated 10 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆162Updated 7 months ago
- TruthfulQA: Measuring How Models Imitate Human Falsehoods☆820Updated 9 months ago