mjy1111 / BAKELinks

This is the repository for our paper: Untying the Reversal Curse via Bidirectional Language Model Editing

☆11

Alternatives and similar repositories for BAKE

Users that are interested in BAKE are comparing it to the libraries listed below

Sorting:

D2I-ai / eigenscore
☆30Updated 7 months ago
zepingyu0512 / awesome-SAE
awesome SAE papers
☆39Updated last month
oneal2000 / MIND
Source code of our paper MIND, ACL 2024 Long Paper
☆43Updated last year
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆165Updated last year
zhenyu-02 / LogitLens4LLMs
A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…
☆93Updated 5 months ago
RUCAIBox / Language-Specific-Neurons
☆75Updated 6 months ago
cooperleong00 / Awesome-LLM-Interpretability
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
☆258Updated 3 months ago
au-revoir / model-editing-ft
☆13Updated 10 months ago
mlwu22 / RED
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆14Updated last year
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆127Updated 9 months ago
fanqiwan / Explore-Instruct
EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration
☆36Updated last year
Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆235Updated last month
Arvid-pku / ATOKE
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆13Updated last year
ZNLP / Language-Imbalance-Driven-Rewarding
[ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
☆19Updated 8 months ago
ZubinGou / math-evaluation-harness
A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨
☆231Updated last year
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆114Updated 10 months ago
BugMakerzzz / toxic_cot
☆11Updated 4 months ago
Jometeorie / probing_llama
☆17Updated last year
MikaStars39 / FeatureAlignment
FeatureAlignment = Alignment + Mechanistic Interpretability
☆28Updated 4 months ago
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆37Updated 8 months ago
GAIR-NLP / alignment-for-honesty
☆74Updated last year
junzhuang-code / LLMSurveySummary
A collection of survey papers and resources related to Large Language Models (LLMs).
☆40Updated last year
pkunlp-icler / IKE
☆24Updated 2 years ago
LuckyyySTA / Awesome-LLM-hallucination
LLM hallucination paper list
☆319Updated last year
zzhang0179 / Unveiling-Linguistic-Regions-in-LLMs
[ACL 2024] Unveiling Linguistic Regions in Large Language Models
☆31Updated last year
Zhaoyi-Li21 / creme
[ACL'2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"
☆12Updated 10 months ago
getao / icae
The repo for In-context Autoencoder
☆129Updated last year
SeekingDream / Static-to-Dynamic-LLMEval
☆29Updated 3 weeks ago
THUNLP-MT / PromptGating4MCTG
This is the repo for our work “An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation” (ACL 2023).
☆13Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆61Updated 7 months ago