Zhaoyi-Li21 / cremeLinks

[ACL'2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"

☆12

Alternatives and similar repositories for creme

Users that are interested in creme are comparing it to the libraries listed below

Sorting:

D2I-ai / eigenscore
☆38Updated 11 months ago
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆148Updated last year
zepingyu0512 / awesome-SAE
awesome SAE papers
☆59Updated 5 months ago
Jometeorie / probing_llama
☆17Updated last year
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆165Updated last year
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆118Updated last year
hrwise-nlp / ToolsMeetLLMs
☆31Updated 6 months ago
AmourWaltz / Reliable-LLM
☆170Updated last year
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆47Updated last year
lorenzkuhn / semantic_uncertainty
☆179Updated last year
xhan77 / context-aware-decoding
☆53Updated last year
pkunlp-icler / IKE
☆25Updated 2 years ago
OSU-NLP-Group / LLM-Knowledge-Conflict
[ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"
☆77Updated last year
Alsace08 / Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆83Updated 11 months ago
RUCAIBox / HaluEval-2.0
☆47Updated last year
au-revoir / model-editing-ft
☆13Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆67Updated 11 months ago
Jeryi-Sun / ReDEeP-ICLR
The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"
☆50Updated 5 months ago
zhenyu-02 / LogitLens4LLMs
A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…
☆127Updated 3 months ago
zhaochen0110 / conflictbank
Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…
☆58Updated 6 months ago
YJiangcm / LTE
[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing
☆36Updated last year
LuckyyySTA / Awesome-LLM-hallucination
LLM hallucination paper list
☆324Updated last year
zepingyu0512 / awesome-LLM-neuron
☆32Updated 5 months ago
BugMakerzzz / toxic_cot
☆12Updated 8 months ago
wangcunxiang / LLM-Factuality-Survey
The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>
☆339Updated last year
AGI-Edgerunners / LLM-Continual-Learning-Papers
Must-read Papers on Large Language Model (LLM) Continual Learning
☆148Updated 2 years ago
yuyq18 / StepTool
☆31Updated 6 months ago
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆86Updated last year
Hunter-DDM / knowledge-neurons
Code for the ACL-2022 paper "Knowledge Neurons in Pretrained Transformers"
☆173Updated last year
yubol-bobo / Awesome-Multi-Turn-LLMs
This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …
☆139Updated 6 months ago