AmritaBh / ConDA-gen-text-detection
Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
☆37Updated last year
Alternatives and similar repositories for ConDA-gen-text-detection:
Users that are interested in ConDA-gen-text-detection are comparing it to the libraries listed below
- The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"☆63Updated 5 months ago
- [AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …☆42Updated last month
- (NAACL 2024) Official code repository for Mixset.☆24Updated 4 months ago
- DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text☆29Updated last year
- Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)☆51Updated last year
- SeqXGPT: An advance method for sentence-level AI-generated text detection.☆87Updated last year
- Code for our CVPR'22 paper: Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources☆35Updated 2 years ago
- M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection☆24Updated last year
- Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…☆99Updated 5 months ago
- ☆25Updated 7 months ago
- Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.☆28Updated 5 months ago
- ☆42Updated 2 months ago
- Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"☆36Updated 9 months ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆15Updated 3 months ago
- ☆24Updated last year
- [TACL] Code for "Red Teaming Language Model Detectors with Language Models"☆19Updated last year
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- ☆19Updated last month
- RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024☆72Updated 6 months ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆74Updated 2 weeks ago
- ☆25Updated last year
- LLMDet is a text detection tool that can identify which generated sources the text came from (e.g. large language model or human-write).☆69Updated 10 months ago
- Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…☆40Updated 3 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23Updated last year
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆54Updated last year
- The lastest paper about detection of LLM-generated text and code☆258Updated 3 months ago
- Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…☆166Updated last year
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆33Updated last year
- ☆26Updated last year
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆24Updated 9 months ago