☆23Jun 30, 2025Updated 8 months ago
Alternatives and similar repositories for MIB-circuit-track
Users that are interested in MIB-circuit-track are comparing it to the libraries listed below
Sorting:
- ☆32Feb 15, 2026Updated 3 weeks ago
- ☆73Jul 24, 2025Updated 7 months ago
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- ☆17Aug 30, 2025Updated 6 months ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated last year
- Engine for collecting, uploading, and downloading model activations☆26Apr 2, 2025Updated 11 months ago
- A library for mechanistic anomaly detection☆22Jan 9, 2025Updated last year
- A library for efficient patching and automatic circuit discovery.☆91Dec 31, 2025Updated 2 months ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- ☆20Apr 10, 2025Updated 10 months ago
- graphpatch is a library for activation patching on PyTorch neural network models.☆21Feb 11, 2025Updated last year
- Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].☆19Dec 9, 2022Updated 3 years ago
- ☆25Feb 20, 2026Updated 2 weeks ago
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21May 16, 2023Updated 2 years ago
- Multi-Layer Sparse Autoencoders (ICLR 2025)☆29Feb 6, 2026Updated last month
- Function Vectors in Large Language Models (ICLR 2024)☆192Apr 17, 2025Updated 10 months ago
- ☆13Oct 5, 2025Updated 5 months ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Apr 28, 2023Updated 2 years ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 8 months ago
- ☆27Jun 12, 2023Updated 2 years ago
- ☆84Feb 25, 2025Updated last year
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Jun 13, 2015Updated 10 years ago
- Radiocarbon calibration command line tool and Haskell module☆11Nov 24, 2025Updated 3 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- ☆52Oct 23, 2023Updated 2 years ago
- Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.☆199Updated this week
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- Spark projects. Learning book "Machine Learning with Spark"☆10Jun 3, 2017Updated 8 years ago
- Implementation of Implicit Reparameterization Trick☆11Dec 2, 2024Updated last year
- ☆14Apr 29, 2025Updated 10 months ago
- Simple clock/cron process that monitors a specific directory and run jobs based on its filename.☆10Jun 8, 2020Updated 5 years ago
- Haskell implementation of HyperLogLog++ & MinHash for efficient cardinality and intersection estimation☆12Aug 1, 2016Updated 9 years ago
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆43Feb 12, 2025Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Simple testing toolkit☆10May 28, 2021Updated 4 years ago
- An interactive AI character with voice input, voice output, and profile image generation—all running locally with Nexa SDK and powered by…☆11Oct 7, 2024Updated last year
- ☆11May 6, 2016Updated 9 years ago