kttian / llm_factuality_tuningLinks

☆39

Alternatives and similar repositories for llm_factuality_tuning

Users that are interested in llm_factuality_tuning are comparing it to the libraries listed below

Sorting:

yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆30Updated last year
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆63Updated 2 years ago
nayeon7lee / FactualityPrompt
☆88Updated 3 years ago
swj0419 / in-context-pretraining
☆55Updated last year
google-research-datasets / GSM-IC
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…
☆65Updated 2 years ago
AI21Labs / factor
Code and data for the FACTOR paper
☆53Updated 2 years ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆118Updated last year
katiekang1998 / llm_hallucinations
☆17Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆69Updated last year
sunlab-osu / Understanding-CoT
☆88Updated 2 years ago
FranxYao / FlanT5-CoT-Specialization
Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.
☆132Updated 2 years ago
cxcscmu / MATES
Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
☆77Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆64Updated last year
allenai / noncompliance
This repository contains data, code and models for contextual noncompliance.
☆24Updated last year
GAIR-NLP / alignment-for-honesty
☆77Updated last year
shizhediao / R-Tuning
[NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…
☆126Updated last year
Nanami18 / Snowballed_Hallucination
☆44Updated last year
ADaM-BJTU / W2SG
The code of “Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning”
☆17Updated last year
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆82Updated last year
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆28Updated last year
edenbiran / RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
☆55Updated last year
ellaneeman / disent_qa
This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.
☆16Updated 2 years ago
mega002 / ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆99Updated 4 years ago
Zce1112zslx / IKE
☆41Updated 2 years ago
Alrope123 / rethinking-demonstrations
☆177Updated last year
xiye17 / TextualExplInContext
The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)
☆16Updated 2 years ago
launchnlp / BOLT
Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".
☆21Updated 2 years ago
FranxYao / Complexity-Based-Prompting
Complexity Based Prompting for Multi-Step Reasoning
☆17Updated 2 years ago
GXimingLu / Quark
☆75Updated 2 years ago