A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enabling layer-wise analysis of hidden states and predictions.
☆172Aug 14, 2025Updated 10 months ago
Alternatives and similar repositories for LogitLens4LLMs
Users that are interested in LogitLens4LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code accompanying the paper "Massive Activations in Large Language Models"☆200Mar 4, 2024Updated 2 years ago
- A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…☆205Mar 4, 2026Updated 3 months ago
- Official code for Guiding Language Model Math Reasoning with Planning Tokens☆19Feb 29, 2024Updated 2 years ago
- Collections of RLxLM experiments using minimal codes☆14Feb 17, 2025Updated last year
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆103Sep 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆32May 19, 2025Updated last year
- ☆28Apr 18, 2025Updated last year
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 8, 2026Updated last week
- ☆11Oct 8, 2022Updated 3 years ago
- Tools for understanding how transformer predictions are built layer-by-layer☆594Aug 7, 2025Updated 10 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆90Mar 23, 2025Updated last year
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated last year
- ☆19Mar 25, 2025Updated last year
- [ICML2025] Official code for "Reinforced Lifelong Editing for Language Models"☆23Feb 23, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Paper list for Efficient Reasoning.☆889May 29, 2026Updated 3 weeks ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆61Feb 7, 2025Updated last year
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆39May 9, 2025Updated last year
- [ICONIP'24]Mingyu.Jin's final year project☆30Aug 23, 2024Updated last year
- awesome papers in LLM interpretability☆621Aug 20, 2025Updated 9 months ago
- ☆50Apr 11, 2025Updated last year
- A hot-pluggable tool for visualizing LLaVA's attention.☆24Jan 29, 2024Updated 2 years ago
- [NeurIPS 2024] Knowledge Circuits in Pretrained Transformers☆170Nov 14, 2025Updated 7 months ago
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆24May 20, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Interpretating the latent space representations of attention head outputs for LLMs☆39Aug 13, 2024Updated last year
- Utilities for the HuggingFace transformers library☆76Jan 21, 2023Updated 3 years ago
- ☆80May 23, 2026Updated 3 weeks ago
- Deep Learning Basic Tutorial (Pytorch, Keras)☆17Nov 8, 2019Updated 6 years ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.☆2,846Jun 11, 2026Updated last week
- [ICML 2026] Elastic Diffusion Transformer: Accelerating SOTA generation models (e.g., Qwen-Image, Hunyuan3d ) through adaptive computatio…☆44May 1, 2026Updated last month
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆260Mar 7, 2026Updated 3 months ago
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)☆13Jan 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year
- awesome SAE papers☆78May 24, 2025Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Single view shoe to 3D model using neural networks + Nvidia Kaolin☆15Mar 24, 2022Updated 4 years ago
- ☆67Dec 3, 2024Updated last year
- A lightweight Inference Engine built for block diffusion models☆46Apr 12, 2026Updated 2 months ago
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆307Jan 22, 2026Updated 4 months ago