oneal2000 / MINDLinks

Source code of our paper MIND, ACL 2024 Long Paper

☆47

Alternatives and similar repositories for MIND

Users that are interested in MIND are comparing it to the libraries listed below

Sorting:

pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆130Updated 10 months ago
D2I-ai / eigenscore
☆32Updated 7 months ago
OSU-NLP-Group / LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆72Updated last year
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆114Updated 10 months ago
RUCAIBox / HaluEval-2.0
☆46Updated last year
RUCAIBox / Language-Specific-Neurons
☆76Updated 7 months ago
GAIR-NLP / alignment-for-honesty
☆74Updated last year
xhan77 / context-aware-decoding
☆46Updated 8 months ago
OpenSafetyLab / SALAD-BENCH
【ACL 2024】 SALAD benchmark & MD-Judge
☆156Updated 5 months ago
zhenyu-02 / LogitLens4LLMs
A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…
☆95Updated 5 months ago
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆68Updated 2 years ago
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆114Updated last year
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆39Updated 8 months ago
PKU-Alignment / beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
☆151Updated last year
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆172Updated last year
oneal2000 / DRAGIN
Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)
☆155Updated 5 months ago
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆62Updated 8 months ago
Jarviswang94 / Multilingual_safety_benchmark
Multilingual safety benchmark for Large Language Models
☆52Updated 11 months ago
LuckyyySTA / Awesome-LLM-hallucination
LLM hallucination paper list
☆320Updated last year
SparkJiao / dpo-trajectory-reasoning
[EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".
☆80Updated 6 months ago
zthang / Focus
☆20Updated last year
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆165Updated last year
Miaoranmmm / SelfChecker
codes for "Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models"
☆10Updated 5 months ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆68Updated last year
weizhepei / InstructRAG
[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
☆112Updated 6 months ago
Hunter-DDM / knowledge-neurons
Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"
☆170Updated last year
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆77Updated 10 months ago
SuperBruceJia / Awesome-LLM-Self-Consistency
Awesome LLM Self-Consistency: a curated list of Self-consistency in Large Language Models
☆105Updated 2 weeks ago
kkkevinkkkkk / situated_faithfulness
☆13Updated 9 months ago