☆36Jun 13, 2025Updated 8 months ago
Alternatives and similar repositories for awesome-LLM-neuron
Users that are interested in awesome-LLM-neuron are comparing it to the libraries listed below
Sorting:
- awesome SAE papers☆74May 24, 2025Updated 9 months ago
- awesome papers in LLM interpretability☆609Aug 20, 2025Updated 6 months ago
- [ACL 2024 Findings] "Understanding and Patching Compositional Reasoning in LLMs"☆13Aug 28, 2024Updated last year
- ☆17Nov 7, 2023Updated 2 years ago
- Code to enable layer-level steering in LLMs using sparse auto encoders☆31Sep 18, 2025Updated 5 months ago
- Must-read papers and blogs about parametric knowledge mechanism in LLMs.☆35May 9, 2025Updated 10 months ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆34Mar 8, 2025Updated last year
- ☆33Aug 5, 2023Updated 2 years ago
- 紫菜鱼的网络安全扫描器☆11Dec 19, 2023Updated 2 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated last year
- [ICLR2024] "Backdoor Federated Learning by Poisoning Backdoor-Critical Layers"☆54Dec 11, 2024Updated last year
- A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..☆294Jan 22, 2026Updated last month
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 2 years ago
- sealos deck☆11Mar 30, 2024Updated last year
- Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models☆12Jun 21, 2024Updated last year
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion☆11Apr 1, 2024Updated last year
- codes and plots for "Active-Dormant Attention Heads: Mechanistically Demystifying Extreme-Token Phenomena in LLMs"☆10Dec 30, 2024Updated last year
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 7 months ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 2 weeks ago
- This is the source code of our paper PALT in EMNLP2022.☆13Nov 19, 2022Updated 3 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- This is a repository for RM2021 Software tutorial☆11Nov 4, 2020Updated 5 years ago
- 在线登录注册(android客户端+javaweb服务端+腾讯云服务器+腾讯云数据库)☆10Nov 11, 2020Updated 5 years ago
- Reproduction Code for Paper "Investigating Multi-Hop Factual Shortcuts in Knowledge Editing of Large Language Models"☆13Jun 1, 2024Updated last year
- ☆13Jan 1, 2018Updated 8 years ago
- A project designed to build and render a full Minecraft crafting tree.☆10Aug 10, 2021Updated 4 years ago
- Tutorials for MATH 4432 Statistical Machine Learning, HKUST, Fall 2022☆11Sep 17, 2024Updated last year
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- An awesome repository & A comprehensive survey on interpretability of LLM attention heads.☆400Mar 2, 2025Updated last year
- ☆65Jun 1, 2025Updated 9 months ago
- Must-read Papers on Knowledge Editing for Large Language Models.☆1,220Jul 12, 2025Updated 7 months ago
- ☆15Jun 4, 2024Updated last year
- ☆14Apr 18, 2025Updated 10 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆15Feb 1, 2025Updated last year
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆23Jul 1, 2025Updated 8 months ago
- CVE-2022-22978 Spring-Security bypass Demo☆16Jun 2, 2022Updated 3 years ago
- 通过分离的方式免杀火绒☆12Dec 15, 2023Updated 2 years ago
- ☆10May 31, 2021Updated 4 years ago