AbhilashaRavichander / HALoGENLinks

Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"

☆23

Alternatives and similar repositories for HALoGEN

Users that are interested in HALoGEN are comparing it to the libraries listed below

Sorting:

HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆69Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆67Updated last year
Nanami18 / Snowballed_Hallucination
☆44Updated last year
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆61Updated last year
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆69Updated 3 years ago
zthang / Focus
☆21Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆119Updated last year
epfl-dlab / llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆80Updated last year
launchnlp / LitCab
☆25Updated 5 months ago
Zce1112zslx / IKE
☆41Updated 2 years ago
balevinstein / Probes
☆57Updated 2 years ago
RUCAIBox / Language-Specific-Neurons
☆87Updated 11 months ago
GAIR-NLP / alignment-for-honesty
☆76Updated last year
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆29Updated last year
GAIR-NLP / BeHonest
BeHonest: Benchmarking Honesty in Large Language Models
☆34Updated last year
OSU-NLP-Group / LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆77Updated last year
roeehendel / icl_task_vectors
☆101Updated 2 years ago
nayeon7lee / FactualityPrompt
☆86Updated 3 years ago
xhan77 / context-aware-decoding
☆53Updated last year
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆47Updated last year
allenai / noncompliance
This repository contains data, code and models for contextual noncompliance.
☆24Updated last year
ehsk / OpenQA-eval
ACL 2023: Evaluating Open-Domain Question Answering in the Era of Large Language Models
☆47Updated last year
qtli / GSM-Plus
GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.
☆63Updated last year
AI21Labs / factor
Code and data for the FACTOR paper
☆52Updated 2 years ago
edenbiran / HoppingTooLate
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆27Updated 8 months ago
SumilerGAO / SunGen
☆27Updated 2 years ago
SeaEval / SeaEval
NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
☆26Updated 8 months ago
SALT-NLP / CultureBank
☆46Updated 2 months ago
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆28Updated last year