YichenZW / Robust-Det
The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (ACL 2024 main) by Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, and Yulia Tsvetkov, and Tianxing He, mainly at Paul G. Allen School of CSE, University of Washington.
☆11Updated 7 months ago
Alternatives and similar repositories for Robust-Det:
Users that are interested in Robust-Det are comparing it to the libraries listed below
- AbstainQA, ACL 2024☆25Updated 4 months ago
- ☆19Updated last year
- Text generation using language models with multiple exit heads☆15Updated 2 weeks ago
- ☆38Updated last year
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆21Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆57Updated last year
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆47Updated 2 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆28Updated 3 months ago
- Evaluate the Quality of Critique☆35Updated 8 months ago
- Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation☆16Updated last year
- ☆35Updated 3 months ago
- Models, data, and codes for the paper: MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models☆18Updated 4 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆104Updated 10 months ago
- ☆30Updated 9 months ago
- LoFiT: Localized Fine-tuning on LLM Representations☆33Updated last month
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated 11 months ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆12Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 5 months ago
- Augmenting Statistical Models with Natural Language Parameters☆23Updated 5 months ago
- Code and data for the FACTOR paper☆44Updated last year
- ☆25Updated 5 months ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆97Updated last year
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆51Updated last year
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆46Updated last year
- Safety-J: Evaluating Safety with Critique☆16Updated 6 months ago
- ☆47Updated last year
- ☆40Updated last year
- [EMNLP 2023] Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models☆22Updated last year
- ☆40Updated 9 months ago