sylinrl / TruthfulQALinks

TruthfulQA: Measuring How Models Imitate Human Falsehoods

☆786

Alternatives and similar repositories for TruthfulQA

Users that are interested in TruthfulQA are comparing it to the libraries listed below

Sorting:

RUCAIBox / HaluEval
This is the repository of HaluEval, a large-scale hallucination evaluation benchmark for Large Language Models.
☆497Updated last year
likenneth / honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
☆540Updated 6 months ago
declare-lab / instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
☆547Updated last year
suzgunmirac / BIG-Bench-Hard
Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them
☆506Updated last year
glgh / awesome-llm-human-preference-datasets
A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.
☆372Updated last year
potsawee / selfcheckgpt
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models
☆549Updated last year
hendrycks / test
Measuring Massive Multitask Language Understanding | ICLR 2021
☆1,464Updated 2 years ago
nelson-liu / lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
☆354Updated last year
allenai / natural-instructions
Expanding natural instructions
☆1,011Updated last year
shmsw25 / FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic…
☆366Updated 3 months ago
kmeng01 / rome
Locating and editing factual associations in GPT (NeurIPS 2022)
☆653Updated last year
tatsu-lab / alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆1,816Updated 7 months ago
SinclairCoder / Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
☆769Updated 2 years ago
GaryYufei / AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
☆731Updated last year
Shark-NLP / OpenICL
OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.
☆569Updated last year
andyzoujm / representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
☆854Updated 11 months ago
google-research / prompt-tuning
Original Implementation of Prompt Tuning from Lester, et al, 2021
☆689Updated 5 months ago
anthropics / hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,770Updated last month
allenai / real-toxicity-prompts
☆216Updated 4 years ago
bigcode-project / bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
☆971Updated 2 weeks ago
tjunlp-lab / Awesome-LLMs-Evaluation-Papers
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.
☆781Updated last year
kojima-takeshi188 / zero_shot_cot
Prod Env
☆424Updated last year
voidism / DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
☆504Updated 6 months ago
jianzhnie / awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
☆688Updated last year
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆956Updated 9 months ago
ruixiangcui / AGIEval
☆758Updated last year
madaan / self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
☆719Updated 10 months ago
RenzeLou / awesome-instruction-learning
Papers and Datasets on Instruction Tuning and Following. ✨✨✨
☆498Updated last year
kmeng01 / memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
☆510Updated last year
tatsu-lab / alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
☆821Updated last year