Sparse and discrete interpretability tool for neural networks
☆64Feb 12, 2024Updated 2 years ago
Alternatives and similar repositories for codebook-features
Users that are interested in codebook-features are comparing it to the libraries listed below
Sorting:
- Fluent dreaming for language models☆13Jul 22, 2024Updated last year
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Jul 9, 2023Updated 2 years ago
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- ☆37Dec 19, 2024Updated last year
- Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.☆23Nov 23, 2022Updated 3 years ago
- This is the official implementation for our ACL 2024 paper: "Causal Estimation of Memorisation Profiles".☆24Mar 25, 2025Updated 11 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆245Dec 16, 2024Updated last year
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Oct 12, 2024Updated last year
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆22Jul 17, 2025Updated 7 months ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆35Oct 16, 2025Updated 4 months ago
- Repo for the paper "Bounding Training Data Reconstruction in Private (Deep) Learning".☆11Jun 16, 2023Updated 2 years ago
- Transformer + GAT for RNA chemical reactivity prediction| Stanford Ribonanza☆11Jan 28, 2026Updated last month
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- A library for efficient patching and automatic circuit discovery.☆90Dec 31, 2025Updated 2 months ago
- ☆26Nov 23, 2023Updated 2 years ago
- ☆209Oct 14, 2025Updated 4 months ago
- ☆118Feb 11, 2025Updated last year
- Multi-encoder segmentation for contrail detection in satellite imagery | Google Researc☆11Jan 28, 2026Updated last month
- ☆12Sep 26, 2019Updated 6 years ago
- Implementation for ACProp ( Momentum centering and asynchronous update for adaptive gradient methdos, NeurIPS 2021)☆16Oct 11, 2021Updated 4 years ago
- ☆10Oct 28, 2024Updated last year
- Sample, estimate, aggregate: A recipe for causal discovery foundation models☆17Jun 21, 2024Updated last year
- ☆14Oct 17, 2023Updated 2 years ago
- ☆12Jul 12, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimization☆16Sep 17, 2025Updated 5 months ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- Post-processing for fair classification☆16Jun 30, 2025Updated 8 months ago
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 9 months ago
- Code for "Theoretical Foundations of Deep Selective State-Space Models" (NeurIPS 2024)☆15Jan 7, 2025Updated last year
- A tiny easily hackable implementation of a feature dashboard.☆15Oct 21, 2025Updated 4 months ago
- ☆13Feb 25, 2025Updated last year
- Code for ProTrix: Building Models for Planning and Reasoning over Tables with Sentence Context☆18Nov 15, 2024Updated last year
- 2nd Place Solution for the Google Research - Identify Contrails to Reduce Global Warming Competition☆14Aug 15, 2023Updated 2 years ago
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated 9 months ago
- ☆29Apr 30, 2024Updated last year