YichenZW / Robust-Det
The code implementation of the paper Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks (ACL 2024 main) by Yichen Wang, Shangbin Feng, Abe Bohan Hou, Xiao Pu, Chao Shen, Xiaoming Liu, and Yulia Tsvetkov, and Tianxing He, mainly at Paul G. Allen School of CSE, University of Washington.
☆9Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Robust-Det
- AbstainQA, ACL 2024☆19Updated last month
- ✨ Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆15Updated last month
- ☆19Updated 10 months ago
- ☆36Updated 6 months ago
- Evaluate the Quality of Critique☆35Updated 5 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆92Updated last year
- ☆18Updated last year
- Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆26Updated last month
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated 11 months ago
- Official codebase for permutation self-consistency.☆16Updated 9 months ago
- ☆36Updated 10 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- [ACL 2024] Code for "MoPS: Modular Story Premise Synthesis for Open-Ended Automatic Story Generation"☆31Updated 4 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 4 months ago
- Retrieval-Enhanced Context-Aware Prefix Encoder for Personalized Dialogue Response Generation☆16Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆65Updated 2 years ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆58Updated 8 months ago
- ☆16Updated last year
- Augmenting Statistical Models with Natural Language Parameters☆17Updated 2 months ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆47Updated 4 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆61Updated 7 months ago
- ☆40Updated 11 months ago
- Text generation using language models with multiple exit heads☆15Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆40Updated 11 months ago
- LoFiT: Localized Fine-tuning on LLM Representations☆21Updated 4 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆13Updated last year
- First explanation metric (diagnostic report) for text generation evaluation☆61Updated 4 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆28Updated 5 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆57Updated last month