csitfun / GLoRE

a benckmark for evaluating logical reasoning of LLMs

☆16

Related projects ⓘ

Alternatives and complementary repositories for GLoRE

GAIR-NLP / alignment-for-honesty
☆65Updated 6 months ago
RUCAIBox / HaluEval-2.0
☆36Updated 10 months ago
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆50Updated 7 months ago
Shark-NLP / self-adaptive-ICL
self-adaptive in-context learning
☆41Updated last year
thunlp / Knowledge-Plugin
Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"
☆57Updated 7 months ago
RuochenZhao / Verify-and-Edit
A framework for editing the CoTs for better factuality
☆40Updated 11 months ago
yunx-z / COMBO
Merging Generated and Retrieved Knowledge for Open-Domain QA (EMNLP 2023)
☆22Updated last year
wangpf3 / consistent-CoT-distillation
☆36Updated last year
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆102Updated 2 months ago
SparkJiao / dpo-trajectory-reasoning
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆42Updated 2 months ago
Bolin97 / awesome-instruction-selector
Paper list and datasets for the paper: A Survey on Data Selection for LLM Instruction Tuning
☆33Updated 9 months ago
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆83Updated 4 months ago
SihengLi99 / LLM-Honesty-Survey
A Survey on the Honesty of Large Language Models
☆46Updated last month
GAIR-NLP / BeHonest
BeHonest: Benchmarking Honesty in Large Language Models
☆30Updated 3 months ago
OhadRubin / EPR
☆59Updated last year
ChengpengLi1003 / DotaMath
☆25Updated last month
hongbinye / Cognitive-Mirage-Hallucinations-in-LLMs
Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"
☆46Updated last year
HKUNLP / icl-ceil
[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.
☆92Updated last year
rookie-joe / AutoPSV
☆31Updated 3 weeks ago
c-box / KnowledgeLifecycle
Paper list of "The Life Cycle of Knowledge in Big Language Models: A Survey"
☆61Updated last year
ChaosCodes / ProPETL
One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning
☆38Updated last year
OpenMOSS / Say-I-Dont-Know
[ICML'2024] Can AI Assistants Know What They Don't Know?
☆70Updated 9 months ago
sunnweiwei / AmbigPrompt
Answering Ambiguous Questions via Iterative Prompting
☆14Updated 5 months ago
khuangaf / ZeroFEC
Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"
☆17Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆60Updated 8 months ago
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆86Updated 2 months ago
zhaochen0110 / conflictbank
Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…
☆28Updated last month
OSU-NLP-Group / LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆61Updated 7 months ago
OpenMatch / Augmentation-Adapted-Retriever
[ACL 2023] This is the code repo for our ACL'23 paper "Augmentation-Adapted Retriever Improves Generalization of Language Models as Gener…
☆58Updated 4 months ago
Yiwei98 / TDG
☆24Updated 8 months ago