YichenZW / Robust-DetLinks

The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (ACL 2024 main) by Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, and Yulia Tsvetkov, and Tianxing He, mainly at Paul G. Allen School of CSE, University of Washington.

☆12

Alternatives and similar repositories for Robust-Det

Users that are interested in Robust-Det are comparing it to the libraries listed below

Sorting:

MichSchli / AVeriTeC
☆65Updated 10 months ago
RUCAIBox / Language-Specific-Neurons
☆85Updated 9 months ago
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆44Updated 11 months ago
MexicanLemonade / LLM-Misinfo-QA
This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).
☆15Updated last year
wang2226 / FOLK
[EMNLP 2023] Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models
☆24Updated last year
thcheung / FactLLaMA
[APSIPA ASC 2023] The official code of paper, "FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Au…
☆18Updated last year
RUCAIBox / HaluEval-2.0
☆47Updated last year
mbzuai-nlp / ProgramFC
Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"
☆56Updated 2 years ago
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆139Updated last year
pkunlp-icler / IKE
☆26Updated 2 years ago
OpenLMLab / Sniffer
☆26Updated 2 years ago
D2I-ai / eigenscore
☆38Updated 10 months ago
SALT-NLP / Efficient_Unlearning
☆38Updated 2 years ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆116Updated last year
xhan77 / context-aware-decoding
☆51Updated 11 months ago
cloudygoose / blindspot_nlg
☆20Updated last year
SALT-NLP / chain-of-thought-bias
☆28Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆28Updated last year
llm-misinformation / llm-misinformation-survey
Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…
☆103Updated 11 months ago
i-gallegos / Fair-LLM-Benchmark
☆152Updated 2 years ago
penguinnnnn / awesome-llm-and-society
Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.
☆50Updated last year
thunlp / LLM-generated-text-detection
☆14Updated last year
zepingyu0512 / awesome-SAE
awesome SAE papers
☆48Updated 4 months ago
Complex-data / MUSER
☆18Updated last year
Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
☆278Updated 4 months ago
Hunter-DDM / knowledge-neurons
Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"
☆172Updated last year
Arvid-pku / ATOKE
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆13Updated last year
llm-misinformation / llm-misinformation
The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"
☆77Updated 11 months ago
Jometeorie / probing_llama
☆17Updated last year
mbzuai-nlp / DetectLLM
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
☆31Updated 2 years ago