BunsenFeng / FactKBLinks
Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.
☆18Updated last year
Alternatives and similar repositories for FactKB
Users that are interested in FactKB are comparing it to the libraries listed below
Sorting:
- ✨ Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆17Updated last week
- AbstainQA, ACL 2024☆25Updated 7 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆40Updated 2 years ago
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14Updated last year
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆36Updated last year
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆21Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆28Updated last year
- Code and Data for NeurIPS2021 Paper "A Dataset for Answering Time-Sensitive Questions"☆72Updated 3 years ago
- ☆21Updated 2 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Updated 2 years ago
- ☆86Updated 2 years ago
- ☆44Updated 9 months ago
- Evaluate the Quality of Critique☆35Updated last year
- ☆26Updated last year
- ☆36Updated last year
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆27Updated last month
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆37Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- Tasks for describing differences between text distributions.☆16Updated 9 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆44Updated 11 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆46Updated last year
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆37Updated 3 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆50Updated 7 months ago
- [EMNLP-2022 Findings] Code for paper “ProGen: Progressive Zero-shot Dataset Generation via In-context Feedback”.☆26Updated 2 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Updated 2 years ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆24Updated 2 months ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆44Updated 7 months ago
- WikiWhy is a new benchmark for evaluating LLMs' ability to explain between cause-effect relationships. It is a QA dataset containing 9000…☆47Updated last year