ColinLu50 / Evade-GPT-Detector
Source code for paper **Large Language Models can be Guided to Evade AI-Generated Text Detection**
☆33Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Evade-GPT-Detector
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆50Updated 5 months ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆75Updated last year
- Official code repository for Mixset.☆21Updated 2 weeks ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆137Updated last year
- [TACL] Code for "Red Teaming Language Model Detectors with Language Models"☆16Updated 11 months ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆60Updated last month
- [ICLR'24 Spotlight] The official codes of our work on AIGC detection: "Multiscale Positive-Unlabeled Detection of AI-Generated Texts"☆105Updated 10 months ago
- ☆88Updated 2 months ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23Updated last year
- COLING'24 Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack☆28Updated 7 months ago
- Code for our NeurIPS2023 accepted paper: RADAR: Robust AI-Text Detection via Adversarial Learning. We tested RADAR on 8 LLMs including Vi…☆39Updated 7 months ago
- Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection☆32Updated 10 months ago
- ☆32Updated 5 months ago
- Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models☆76Updated 3 months ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆46Updated last year
- Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.☆196Updated 2 months ago
- Multilingual safety benchmark for Large Language Models☆22Updated 2 months ago
- Benchmarking LLMs' Emotional Alignment with Humans☆63Updated last month
- 【ACL 2024】 SALAD benchmark & MD-Judge☆103Updated last month
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆83Updated last month
- LLM Unlearning☆123Updated last year
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆61Updated 9 months ago
- [NAACL 2024] End-to-End Beam Retrieval for Multi-Hop Question Answering☆76Updated 7 months ago
- Scaling Sentence Embeddings with Large Language Models☆98Updated 7 months ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆36Updated 5 months ago
- Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency☆33Updated 5 months ago
- ☆38Updated last year
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆27Updated last year
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…☆207Updated last year
- Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆192Updated 3 months ago