IBM / RADAR
Code for our NeurIPS2023 accepted paper: RADAR: Robust AI-Text Detection via Adversarial Learning. We tested RADAR on 8 LLMs including Vicuna and LLaMA. The results show that RADAR can attain good detection performance on LLM-generated AI-text while being robust against paraphrasing.
☆45Updated 11 months ago
Alternatives and similar repositories for RADAR:
Users that are interested in RADAR are comparing it to the libraries listed below
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆67Updated 3 months ago
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆80Updated last year
- Source code for paper **Large Language Models can be Guided to Evade AI-Generated Text Detection**☆35Updated last year
- A complete overview and insights into AI-Text detection using the powerful BERT(Bi-directional encoder representation transformer) to pr…☆49Updated last year
- Ghostbuster: Detecting Text Ghostwritten by Large Language Models (NAACL 2024)☆139Updated 8 months ago
- (NAACL 2024) Official code repository for Mixset.☆21Updated 2 months ago
- Can AI-Generated Text be Reliably Detected?☆72Updated last year
- ☆20Updated 7 months ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆157Updated last year
- [AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …☆38Updated 2 weeks ago
- RAID is the largest and most challenging benchmark for AI-generated text detection. (ACL 2024)☆53Updated this week
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆21Updated 10 months ago
- This project aims to build upon existing MGTBench project, extending its functionalities with the option to import and evaluate the bench…☆13Updated 3 months ago
- ☆42Updated 8 months ago
- Code/data for MARG (multi-agent review generation)☆38Updated 3 months ago
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text☆25Updated last year
- COLING'24 Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack☆38Updated 10 months ago
- A survey and reflection on the latest research breakthroughs in LLM-generated Text detection, including data, detectors, metrics, current…☆199Updated last month
- Offiical codes for DNA-GPT (ICLR 2024)☆50Updated 10 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 8 months ago
- Transformer-based model for learning authorship representations.☆32Updated 6 months ago
- The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (A…☆11Updated 7 months ago
- This tool will parse the inputted text into many different existing AI writing detection tools and return the results to you. This is to …☆54Updated last year
- Official repo for SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check Consistency☆35Updated last month
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆61Updated 8 months ago
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆67Updated last year
- Code base for "Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature".☆230Updated last month
- The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Lang…☆92Updated last month
- AI-Generated Text Detection: A BERT-powered solution for accurately identifying AI-generated text. Seamlessly integrated, highly accurate…☆50Updated 7 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 6 months ago