eric-mitchell / detect-gpt
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
☆357Updated last year
Related projects ⓘ
Alternatives and complementary repositories for detect-gpt
- ☆528Updated 8 months ago
- Pytorch implementation of DetectGPT (https://arxiv.org/pdf/2301.11305v1.pdf)☆178Updated 5 months ago
- Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.☆197Updated 2 months ago
- The lastest paper about detection of LLM-generated text and code☆218Updated this week
- Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆198Updated 4 months ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆138Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆174Updated this week
- UP-TO-DATE LLM Watermark paper. 🔥🔥🔥☆293Updated this week
- Can AI-Generated Text be Reliably Detected?☆62Updated last year
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆76Updated last year
- Accompanying repo for the RLPrompt paper☆301Updated 5 months ago
- We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20…☆241Updated 8 months ago
- ☆111Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆61Updated this week
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆208Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆126Updated last year
- A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…☆292Updated 6 months ago
- LLM Unlearning☆125Updated last year
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆50Updated 5 months ago
- Code for watermarking language models☆72Updated 2 months ago
- ☆275Updated 3 months ago
- A Survey of Attributions for Large Language Models☆168Updated 2 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆106Updated last month
- The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".☆245Updated 3 weeks ago
- Source Code of Paper "GPTScore: Evaluate as You Desire"☆231Updated last year
- Official code repository for Mixset.☆21Updated 3 weeks ago
- A resource repository for machine unlearning in large language models☆218Updated last week
- Repository for research in the field of Responsible NLP at Meta.☆186Updated last week
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆469Updated last month
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆157Updated 6 months ago