eric11eca / disco
Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator supports GPT-3, ChatGPT, and GPT-4.
☆36Updated last year
Alternatives and similar repositories for disco:
Users that are interested in disco are comparing it to the libraries listed below
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated last year
- AbstainQA, ACL 2024☆25Updated 3 months ago
- Supporting code for ReCEval paper☆27Updated 4 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆22Updated 11 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- ☆85Updated last year
- ☆40Updated last year
- Evaluate the Quality of Critique☆35Updated 7 months ago
- ✨ Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆15Updated 3 months ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆63Updated 2 months ago
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 4 months ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆25Updated last year
- ☆25Updated last year
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 9 months ago
- ☆38Updated 5 months ago
- [EMNLP 2023] Knowledge Rumination for Pre-trained Language Models☆17Updated last year
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆41Updated 2 months ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated 11 months ago
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated 7 months ago
- ☆50Updated last year
- Code for reproducing the ACL'23 paper: Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments☆73Updated last year
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆68Updated 3 weeks ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- Code for our paper: "GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models"☆53Updated last year
- Implementation of the paper: "Answering Questions by Meta-Reasoning over Multiple Chains of Thought"☆93Updated last year
- ☆60Updated 2 years ago
- Official codebase for permutation self-consistency.☆16Updated 11 months ago
- Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"☆65Updated 5 months ago
- Offical code of the paper Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Le…☆70Updated 10 months ago