D2I-ai / eigenscoreLinks

☆39

Alternatives and similar repositories for eigenscore

Users that are interested in eigenscore are comparing it to the libraries listed below

Sorting:

zepingyu0512 / awesome-SAE
awesome SAE papers
☆69Updated 7 months ago
lorenzkuhn / semantic_uncertainty
☆181Updated last year
Zhaoyi-Li21 / creme
[ACL'2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"
☆13Updated last year
Jometeorie / probing_llama
☆17Updated last year
xhan77 / context-aware-decoding
☆54Updated last year
pkunlp-icler / IKE
☆25Updated 2 years ago
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆48Updated last year
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆150Updated last year
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆168Updated last year
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆118Updated last year
Arvid-pku / ATOKE
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆14Updated 2 years ago
AmourWaltz / Reliable-LLM
☆178Updated last year
RUCAIBox / HaluEval-2.0
☆48Updated last year
BugMakerzzz / toxic_cot
☆12Updated 9 months ago
oneal2000 / MIND
Source code of our paper MIND, ACL 2024 Long Paper
☆59Updated last month
YJiangcm / LTE
[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing
☆36Updated last year
zhenyu-02 / LogitLens4LLMs
A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…
☆140Updated 4 months ago
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆68Updated last year
Jeryi-Sun / ReDEeP-ICLR
The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"
☆54Updated 6 months ago
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆86Updated last year
nusnlp / FSPO
Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆20Updated last month
AGI-Edgerunners / LLM-Continual-Learning-Papers
Must-read Papers on Large Language Model (LLM) Continual Learning
☆148Updated 2 years ago
alon-albalak / data-selection-survey
A Survey on Data Selection for Language Models
☆253Updated 7 months ago
deeplearning-wisc / picle
Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)
☆26Updated last year
ZFancy / awesome-activation-engineering
A curated list of resources for activation engineering
☆119Updated 2 months ago
CaoYuanpu / BiPO
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
☆39Updated last year
zhiyuanhubj / UoT
[NeurIPS 2024] Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
☆105Updated last year
kkkevinkkkkk / situated_faithfulness
☆14Updated last year
RUCAIBox / Language-Specific-Neurons
☆89Updated last year
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆178Updated 2 years ago