zhenyu-02/LogitLens4LLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhenyu-02/LogitLens4LLMs)

zhenyu-02 / LogitLens4LLMs

A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.

☆174

Alternatives and similar repositories for LogitLens4LLMs

Users that are interested in LogitLens4LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arnab-api / Logit-Lens-Interpreting-GPT-2
View on GitHub
☆16Jan 31, 2023Updated 3 years ago
LLM-MI-Research / Actionable-MI
View on GitHub
☆15Jan 20, 2026Updated 6 months ago
nrimsky / LM-exp
View on GitHub
LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces
☆105Sep 21, 2023Updated 2 years ago
epfl-dlab / llm-latent-language
View on GitHub
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆87Mar 11, 2024Updated 2 years ago
locuslab / massive-activations
View on GitHub
Code accompanying the paper "Massive Activations in Large Language Models"
☆202Mar 4, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
AlignmentResearch / tuned-lens
View on GitHub
Tools for understanding how transformer predictions are built layer-by-layer
☆605Aug 7, 2025Updated 11 months ago
mainlp / Multilingual-Refusal
View on GitHub
☆16Nov 5, 2025Updated 8 months ago
VanillaCreamer / Awesome-Personalized-LLMs
View on GitHub
The latest progress of Personalized Large Language Models (LLMs).
☆52Updated this week
itsqyh / Awesome-LMMs-Mechanistic-Interpretability
View on GitHub
A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…
☆215Mar 4, 2026Updated 4 months ago
koalazf99 / nanoverl
View on GitHub
Collections of RLxLM experiments using minimal codes
☆14Feb 17, 2025Updated last year
Zhaoyi-Li21 / creme
View on GitHub
[ACL 2024] "Understanding and Patching Compositional Reasoning in LLMs"
☆14Aug 28, 2024Updated last year
eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago
TransformerLensOrg / TransformerLens
View on GitHub
A library for mechanistic interpretability of GPT-style language models
☆3,723Updated this week
Y-L-LIU / MGTBench-2.0
View on GitHub
☆28Apr 18, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MLRM-Halu / MLRM-Halu
View on GitHub
[NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models
☆82May 31, 2025Updated last year
Purshow / Awesome-LVLM-Hallucination
View on GitHub
☆56Nov 26, 2024Updated last year
RUCKBReasoning / CoT-based-Synthesizer
View on GitHub
Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'
☆32May 19, 2025Updated last year
Ther-nullptr / circult-eda-mlsys-tinyml-arxiv-daily
View on GitHub
🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)
☆10Updated this week
ml-researcher / VAE
View on GitHub
☆11Oct 8, 2022Updated 3 years ago
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
TaiMingLu / know-dont-tell
View on GitHub
☆19Oct 14, 2024Updated last year
ablghtianyi / ICL_Modular_Arithmetic
View on GitHub
☆19Mar 25, 2025Updated last year
YangLing0818 / SuperCorrect-llm
View on GitHub
[ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
☆90Mar 23, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆15Mar 25, 2025Updated last year
kamanphoebe / Look-into-MoEs
View on GitHub
[NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models
☆61Feb 7, 2025Updated last year
zepingyu0512 / awesome-llm-understanding-mechanism
View on GitHub
awesome papers in LLM interpretability
☆624Aug 20, 2025Updated 11 months ago
dadelani / sib-200
View on GitHub
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
☆26May 20, 2026Updated 2 months ago
RobertCsordas / onion_representations
View on GitHub
☆13Aug 19, 2024Updated last year
WANGXinyiLinda / planning_tokens
View on GitHub
Official code for Guiding Language Model Math Reasoning with Planning Tokens
☆19Feb 29, 2024Updated 2 years ago
ArjunPanickssery / self_recognition
View on GitHub
☆10May 17, 2024Updated 2 years ago
ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models
View on GitHub
This repository collects all relevant resources about interpretability in LLMs
☆402Nov 1, 2024Updated last year
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
nju-websoft / TransEdge
View on GitHub
TransEdge: Translating Relation-contextualized Embeddings for Knowledge Graphs, ISWC 2019
☆28Dec 21, 2019Updated 6 years ago
lukahhcm / Awesome_Environment_Scaling
View on GitHub
Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …
☆72Jan 28, 2026Updated 6 months ago
microsoft / RTP-LX
View on GitHub
Repository for the paper "RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios?"
☆29May 1, 2025Updated last year
VITA-Group / READ-ME
View on GitHub
[NeurIPS2024] "Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design", Ruisi Cai, Yeonju Ro, Geon-Woo …
☆16Dec 16, 2024Updated last year
junyangwang0410 / Attention-LLaVA
View on GitHub
A hot-pluggable tool for visualizing LLaVA's attention.
☆24Jan 29, 2024Updated 2 years ago
zjunlp / KnowledgeCircuits
View on GitHub
[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers
☆172Nov 14, 2025Updated 8 months ago
Karim-53 / Compare-xAI
View on GitHub
🧪 A unified benchmark to evaluate & compare Explainable AI methods (SHAP, LIME, ...) via functional tests. Live results + paper (arXiv:2…
☆14Jul 3, 2026Updated 3 weeks ago