YichenZW / Robust-Det
The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (ACL 2024 main) by Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, and Yulia Tsvetkov, and Tianxing He, mainly at Paul G. Allen School of CSE, University of Washington.
☆11Updated 8 months ago
Alternatives and similar repositories for Robust-Det:
Users that are interested in Robust-Det are comparing it to the libraries listed below
- AbstainQA, ACL 2024☆25Updated 5 months ago
- ☆19Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆58Updated last year
- ✨ Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆15Updated 5 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆52Updated 4 months ago
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆29Updated 4 months ago
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆52Updated last year
- ☆40Updated 11 months ago
- Augmenting Statistical Models with Natural Language Parameters☆23Updated 6 months ago
- ☆25Updated 6 months ago
- ☆41Updated last year
- LoFiT: Localized Fine-tuning on LLM Representations☆34Updated 2 months ago
- ☆39Updated 4 months ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆21Updated 2 weeks ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated 5 months ago
- ☆17Updated last year
- ☆68Updated 3 months ago
- Official code implementation of SKU, Accepted by ACL 2024 Findings☆13Updated 3 months ago
- Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation☆17Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated 2 years ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆98Updated 2 years ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆110Updated 6 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆54Updated 11 months ago
- ☆29Updated 11 months ago
- Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"☆165Updated 10 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆63Updated last year
- ☆31Updated last month
- Code and data for the FACTOR paper☆44Updated last year