Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"
☆29Jun 4, 2024Updated 2 years ago
Alternatives and similar repositories for leela-interp
Users that are interested in leela-interp are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆14Feb 13, 2023Updated 3 years ago
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- ☆10Dec 4, 2024Updated last year
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆24Oct 18, 2024Updated last year
- A Mechanistic Interpretability Analysis of Grokking☆27Sep 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Feb 20, 2026Updated 3 months ago
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- A tiny easily hackable implementation of a feature dashboard.☆16Oct 21, 2025Updated 7 months ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆21Oct 24, 2025Updated 7 months ago
- ☆17Feb 14, 2024Updated 2 years ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- Code for reproducing the paper "Neural Networks Fail to Learn Periodic Functions and How to Fix It" as part of the ML Reproducibility Cha…☆11Apr 16, 2021Updated 5 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 5 years ago
- Applying SAEs for fine-grained control☆27Dec 15, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Minimal but scalable implementation of large language models in JAX☆34Nov 28, 2025Updated 6 months ago
- Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"☆35Sep 18, 2024Updated last year
- graphpatch is a library for activation patching on PyTorch neural network models.☆21Feb 11, 2025Updated last year
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- Master thesis: Exploring bias in German NLG (GPT-3 & GerPT-2). Applies regard classification and bias mitigation triggers.☆16Sep 25, 2024Updated last year
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- Multi-agent simulator in Jax for research and teaching in AI & ALife☆31Apr 11, 2026Updated 2 months ago
- A tool for visualization of complex job searches.☆13Jul 8, 2022Updated 3 years ago
- ☆27Oct 6, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The "CoT-ICL Lab" framework for meta-training transformers☆11Jun 3, 2026Updated last week
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆68Aug 15, 2025Updated 10 months ago
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆25Feb 16, 2026Updated 3 months ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆73Apr 15, 2026Updated 2 months ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆52May 23, 2025Updated last year
- The EMP Jammer is an innovative jamming device which jams the devices nearby by inducing an alternating voltage in it .☆13Jan 3, 2023Updated 3 years ago
- ☆25Apr 23, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Real-time latent exploration of diffusion models☆29Apr 21, 2024Updated 2 years ago
- ☆36Apr 14, 2025Updated last year
- Concept Learning Dynamics☆16Oct 29, 2024Updated last year
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆16May 28, 2025Updated last year
- ☆33Nov 30, 2025Updated 6 months ago
- GULAG: GUessing LAnGuages with neural networks☆13May 4, 2022Updated 4 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year