1171-jpg / BrainTeaser
☆13Updated last year
Alternatives and similar repositories for BrainTeaser:
Users that are interested in BrainTeaser are comparing it to the libraries listed below
- UNISUMM: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning☆60Updated last year
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆98Updated 2 years ago
- [EMNLP 2022] Code for our paper “ZeroGen: Efficient Zero-shot Learning via Dataset Generation”.☆16Updated 3 years ago
- ☆25Updated last year
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Updated 3 years ago
- A unified benchmark for math reasoning☆87Updated 2 years ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆24Updated last month
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆25Updated last year
- ☆48Updated 2 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆58Updated last year
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Updated last year
- [NLPCC 2022] Kformer: Knowledge Injection in Transformer Feed-Forward Layers☆36Updated 2 years ago
- ☆19Updated last year
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 7 months ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year
- ☆73Updated last year
- ☆41Updated last year
- AbstainQA, ACL 2024☆25Updated 5 months ago
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- [NeurIPS 2022] Generating Training Data with Language Models: Towards Zero-Shot Language Understanding☆64Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆51Updated 7 months ago
- ☆18Updated last year
- ☆25Updated 2 years ago
- Repository for ACL'22 paper: Dynamic Latent Extraction for Abstractive Long-Input Summarization☆55Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆56Updated 8 months ago
- templates and other documents regarding responsible NLP research☆67Updated last year
- ☆28Updated last year
- ☆48Updated 11 months ago
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆67Updated 3 years ago