AmritaBh / ConDA-gen-text-detectionLinks
Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
☆40Updated last year
Alternatives and similar repositories for ConDA-gen-text-detection
Users that are interested in ConDA-gen-text-detection are comparing it to the libraries listed below
Sorting:
- The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"☆76Updated 10 months ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆31Updated last year
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text☆30Updated 2 years ago
- [AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …☆43Updated 5 months ago
- ☆28Updated 11 months ago
- Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…☆103Updated 10 months ago
- ☆26Updated last year
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆58Updated last year
- (NAACL 2024) Official code repository for Mixset.☆26Updated 9 months ago
- Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…☆42Updated 4 years ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆34Updated 3 years ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆34Updated 10 months ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23Updated 2 years ago
- ☆44Updated 7 months ago
- Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment☆105Updated last year
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆30Updated 3 years ago
- The lastest paper about detection of LLM-generated text and code☆277Updated 2 months ago
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆81Updated 11 months ago
- ☆39Updated 2 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Updated 4 years ago
- ☆14Updated last year
- This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).☆14Updated last year
- [EMNLP 2025 Main] ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆35Updated 3 weeks ago
- ☆22Updated 2 years ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆174Updated last year
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆82Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆68Updated 2 years ago
- Data for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder"☆20Updated last year
- Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)☆26Updated last year
- ☆38Updated last year