YichenZW / Robust-DetLinks
The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (ACL 2024 main) by Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, and Yulia Tsvetkov, and Tianxing He, mainly at Paul G. Allen School of CSE, University of Washington.
☆13Updated last year
Alternatives and similar repositories for Robust-Det
Users that are interested in Robust-Det are comparing it to the libraries listed below
Sorting:
- ☆88Updated 11 months ago
- ☆69Updated last year
- This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).☆15Updated last year
- [EMNLP 2023] Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models☆25Updated last year
- ☆38Updated 11 months ago
- ☆53Updated last year
- The lastest paper about detection of LLM-generated text and code☆281Updated 5 months ago
- ☆20Updated last year
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆47Updated last year
- ☆27Updated 2 years ago
- ☆47Updated last year
- Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…☆103Updated last year
- AbstainQA, ACL 2024☆28Updated last year
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆148Updated last year
- Source code of our paper MIND, ACL 2024 Long Paper☆57Updated 2 weeks ago
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆57Updated 2 years ago
- Safety-J: Evaluating Safety with Critique☆16Updated last year
- [ACL'2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"☆13Updated last year
- ☆11Updated last year
- awesome SAE papers☆60Updated 6 months ago
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model☆14Updated last year
- Source code of ACL 2023 accepted paper "AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression"☆12Updated 2 years ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆119Updated last year
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Updated 2 years ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆19Updated 3 months ago
- The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"☆50Updated 5 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆130Updated 3 months ago
- ☆28Updated last year
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆60Updated 6 months ago
- [NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios☆40Updated 11 months ago