YichenZW / Robust-DetLinks
The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (ACL 2024 main) by Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, and Yulia Tsvetkov, and Tianxing He, mainly at Paul G. Allen School of CSE, University of Washington.
☆12Updated 11 months ago
Alternatives and similar repositories for Robust-Det
Users that are interested in Robust-Det are comparing it to the libraries listed below
Sorting:
- AbstainQA, ACL 2024☆26Updated 8 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆68Updated 2 years ago
- Text generation using language models with multiple exit heads☆16Updated last month
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆60Updated 7 months ago
- Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"☆65Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- ☆26Updated 9 months ago
- ☆19Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆17Updated last week
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆35Updated 7 months ago
- ConceptVectors Benchmark and Code for the paper "Intrinsic Evaluation of Unlearning Using Parametric Knowledge Traces"☆35Updated 4 months ago
- ☆29Updated last year
- ☆31Updated 7 months ago
- Augmenting Statistical Models with Natural Language Parameters☆27Updated 9 months ago
- ☆44Updated 9 months ago
- ☆44Updated 7 months ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆24Updated 3 months ago
- Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"☆27Updated this week
- ☆44Updated last year
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆49Updated last year
- Official code implementation of SKU, Accepted by ACL 2024 Findings☆15Updated 6 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated last year
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆57Updated 10 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆35Updated last month
- (NAACL 2024) Official code repository for Mixset.☆25Updated 6 months ago
- Codes for ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆31Updated 2 years ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆59Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆112Updated last year
- ☆75Updated 6 months ago
- Code for the ACL 2023 Paper "Fact-Checking Complex Claims with Program-Guided Reasoning"☆56Updated last year