LinxinS97 / NLPBench
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for NLPBench
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆51Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated last year
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆24Updated last year
- ☆33Updated 3 years ago
- This repository contains data, code and models for contextual noncompliance.☆18Updated 4 months ago
- Code associated with the paper: "Few-Shot Self-Rationalization with Natural Language Prompts"☆13Updated 2 years ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆35Updated last year
- The codes for our ACL'22 paper: PRBOOST: Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised Learning.☆34Updated 2 years ago
- ☆40Updated 11 months ago
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆20Updated 2 years ago
- ☆48Updated last year
- Findings of ACL'2023: Optimizing Test-Time Query Representations for Dense Retrieval☆29Updated last year
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆69Updated last year
- ☆42Updated 10 months ago
- A framework to train language models to learn invariant representations.☆12Updated 2 years ago
- ☆36Updated 7 months ago
- Repository of paper "LLMs with Chain-of-Thought Are Non-Causal Reasoners"☆15Updated 7 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆55Updated last year
- Methods and evaluation for aligning language models temporally☆24Updated 8 months ago
- ☆44Updated 2 months ago
- Code for the paper "Learning Variational Word Masks to Improve the Interpretability of Neural Text Classifiers"☆17Updated 3 years ago
- TBC☆26Updated 2 years ago
- Restore safety in fine-tuned language models through task arithmetic☆26Updated 7 months ago
- [NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…☆29Updated last year
- ☆24Updated last year
- AbstainQA, ACL 2024☆19Updated last month
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆22Updated last year
- Code for the ACL2022 paper "Synthetic Question Value Estimation for Domain Adaptation of Question Answering"☆16Updated 2 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆14Updated last month