YichenZW / Robust-DetLinks
The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (ACL 2024 main) by Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, and Yulia Tsvetkov, and Tianxing He, mainly at Paul G. Allen School of CSE, University of Washington.
☆13Updated last year
Alternatives and similar repositories for Robust-Det
Users that are interested in Robust-Det are comparing it to the libraries listed below
Sorting:
- ☆91Updated last year
- ☆20Updated 2 years ago
- ☆71Updated last year
- ☆41Updated last year
- This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).☆16Updated 2 years ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆50Updated last year
- Safety-J: Evaluating Safety with Critique☆16Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆151Updated last year
- ☆57Updated last year
- [EMNLP 2023] Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models☆26Updated 2 years ago
- ☆48Updated 2 years ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆119Updated last year
- Code for the arxiv paper: Complex Claim Verification with Evidence Retrieved in the Wild☆13Updated 2 years ago
- The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"☆59Updated 8 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆72Updated 3 weeks ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆71Updated 3 years ago
- ☆25Updated 2 years ago
- AbstainQA, ACL 2024☆28Updated this week
- awesome SAE papers☆71Updated 8 months ago
- ☆13Updated 2 years ago
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆57Updated 2 years ago
- ☆37Updated 2 years ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Updated last year
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆173Updated last year
- ☆38Updated 2 years ago
- ☆184Updated last year
- [ACL'2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"☆13Updated last year
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Updated 2 years ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆64Updated 8 months ago
- The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"☆80Updated last year