[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evaluations".
☆36Jun 8, 2023Updated 2 years ago
Alternatives and similar repositories for OOD_NLP
Users that are interested in OOD_NLP are comparing it to the libraries listed below
Sorting:
- The source code of "Empowering Language Understanding with Counterfactual Reasoning" (ACL'21)☆11Sep 3, 2021Updated 4 years ago
- ☆10Aug 10, 2024Updated last year
- Code for ACL 2023 paper "A Close Look into the Calibration of Pre-trained Language Models"☆11May 9, 2023Updated 2 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- The official implementation of AAAI'24 paper: Self-Interpretable Graph Learning with Sufficient and Necessary Explanations.☆15Jan 29, 2024Updated 2 years ago
- ☆16Nov 26, 2024Updated last year
- Masking tokens to modify the predictions of a pretrained sentence classifier☆16Feb 4, 2020Updated 6 years ago
- ☆13Oct 20, 2022Updated 3 years ago
- ☆38Jul 13, 2022Updated 3 years ago
- CSCW 2023 Best Demo Award: Conversational AI Explanations to Support Human-AI Scientific Writing☆14Jun 25, 2023Updated 2 years ago
- Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"☆17Mar 21, 2022Updated 3 years ago
- Group-conditional DRO to alleviate spurious correlations☆15Jul 15, 2021Updated 4 years ago
- ☆17Jul 6, 2020Updated 5 years ago
- ☆44Oct 30, 2025Updated 4 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Jan 21, 2025Updated last year
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆134Jun 20, 2023Updated 2 years ago
- Multi-task modelling extensions for huggingface transformers☆21Mar 3, 2023Updated 3 years ago
- ☆26Nov 21, 2022Updated 3 years ago
- [NAACL 2022] "SemAttack: Natural Textual Attacks via Different Semantic Spaces" by Boxin Wang, Chejian Xu, Xiangyu Liu, Yu Cheng, Bo Li☆21Jun 11, 2022Updated 3 years ago
- NeurIPS'24 - LLM Safety Landscape☆39Oct 21, 2025Updated 4 months ago
- ☆27Mar 21, 2024Updated last year
- ☆23Jun 15, 2022Updated 3 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- [ICLR'22] Self-supervised learning optimally robust representations for domain shift.☆25Feb 2, 2022Updated 4 years ago
- Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"☆30Jul 8, 2020Updated 5 years ago
- ☆31Jun 12, 2023Updated 2 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆27Sep 12, 2021Updated 4 years ago
- Repository for the Bias Benchmark for QA dataset.☆138Jan 8, 2024Updated 2 years ago
- Examples of App of Apps Pattern☆10Jan 17, 2023Updated 3 years ago
- ☆32May 24, 2023Updated 2 years ago
- HANNA, a large annotated dataset of Human-ANnotated NArratives for ASG evaluation.☆35Oct 15, 2024Updated last year
- Automated Benchmarking of LLM Agents on Real-World Software Security Tasks [NeurIPS 2025]☆56Jan 27, 2026Updated last month
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆137Mar 14, 2024Updated last year
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆133Jun 4, 2024Updated last year
- 내맘대로 alluxio 정리중☆11May 13, 2019Updated 6 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Beyond Accuracy: What Matters in Designing Well-Behaved Models?☆18Updated this week
- ☆12Aug 2, 2024Updated last year