AmritaBh / ConDA-gen-text-detectionLinks

Code for the paper: ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

☆41

Alternatives and similar repositories for ConDA-gen-text-detection

Users that are interested in ConDA-gen-text-detection are comparing it to the libraries listed below

Sorting:

mbzuai-nlp / M4
M4: Multi-generator, Multi-domain, and Multi-lingual Black-Box Machine-Generated Text Detection
☆30Updated last year
PLUM-Lab / Mocheg
Dataset and Code for Multimodal Fact Checking and Explanation Generation (Mocheg)
☆57Updated last year
llm-misinformation / llm-misinformation
The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"
☆71Updated 9 months ago
llm-misinformation / llm-misinformation-survey
Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…
☆102Updated 9 months ago
amazon-science / controlling-llm-memorization
☆36Updated 2 years ago
ryuryukke / OUTFOX
[AAAI 2024] The official repository for our paper, "OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially …
☆44Updated 4 months ago
mbzuai-nlp / DetectLLM
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
☆30Updated 2 years ago
leix28 / prompt-universal-vulnerability
Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022
☆30Updated 3 years ago
SALT-NLP / chain-of-thought-bias
☆28Updated 10 months ago
snw2021 / LLM_Unlearning_Papers
☆26Updated last year
minicheshire / Robust-Prefix-Tuning
code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification
☆27Updated 3 years ago
Vaidehi99 / InfoDeletionAttacks
☆44Updated 6 months ago
mireshghallah / ft-memorization
☆13Updated 2 years ago
mireshghallah / neighborhood-curvature-mia
☆21Updated last year
martiansideofthemoon / ai-detection-paraphrases
Official repository for our NeurIPS 2023 paper "Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense…
☆173Updated last year
xinleihe / toxic-prompt
☆25Updated last year
Dongping-Chen / MixSet
(NAACL 2024) Official code repository for Mixset.
☆26Updated 8 months ago
Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
☆274Updated last month
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆77Updated 10 months ago
QData / TextAttack-A2T
A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)
☆26Updated 3 years ago
lancopku / Embedding-Poisoning
Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…
☆41Updated 4 years ago
thu-coai / Targeted-Data-Extraction
Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…
☆23Updated 2 years ago
skywalker023 / confaide
🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…
☆42Updated last year
Hunter-DDM / knowledge-neurons
Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"
☆170Updated last year
THU-BPM / Robust_Watermark
Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.
☆32Updated 8 months ago
MexicanLemonade / LLM-Misinfo-QA
This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).
☆14Updated last year
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆28Updated last year
joeljang / knowledge-unlearning
[ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models
☆82Updated 10 months ago
cookielee77 / CLARE
Contextualized Perturbation for Textual Adversarial Attack, NAACL 2021
☆43Updated 4 years ago
Princeton-SysML / kNNLM_privacy
Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888
☆36Updated last year