Redwood Research's transformer interpretability tools
☆15Apr 15, 2022Updated 3 years ago
Alternatives and similar repositories for interp
Users that are interested in interp are comparing it to the libraries listed below
Sorting:
- Mechanistic Interpretability for Transformer Models☆53Jun 1, 2022Updated 3 years ago
- Code for reproducing the results from the paper Avoiding Side Effects in Complex Environments☆12Jun 3, 2021Updated 4 years ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- A framework for implementing equivariant DL☆10May 25, 2021Updated 4 years ago
- Machine Learning for Alignment Bootcamp☆82Apr 27, 2022Updated 3 years ago
- Build a Docker container to build, train and deploy fast.ai based Deep Learning models with Amazon SageMaker☆13Dec 15, 2018Updated 7 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- Code for Multi-scale Orderless Pooling of Deep Convolutional Activation Features☆13Mar 6, 2017Updated 8 years ago
- A benchmark for mechanistic discovery of circuits in Transformers☆16Dec 15, 2024Updated last year
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆16Sep 15, 2023Updated 2 years ago
- ☆27Nov 28, 2024Updated last year
- Trained models for keras-rl.☆21Sep 24, 2016Updated 9 years ago
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆20Oct 18, 2024Updated last year
- ☆57Jun 15, 2023Updated 2 years ago
- A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations☆205Dec 22, 2021Updated 4 years ago
- Understanding how features learned by neural networks evolve throughout training☆41Oct 24, 2024Updated last year
- A library for finding knowledge neurons in pretrained transformer models.☆159Feb 13, 2022Updated 4 years ago
- ☆35Jan 4, 2023Updated 3 years ago
- ☆11Feb 26, 2026Updated last week
- Topic modelling and co-occurrence analysis of the bio-economy☆10Jul 17, 2017Updated 8 years ago
- ☆13Jul 20, 2023Updated 2 years ago
- Starter kit and data loading code for the Trojan Detection Challenge NeurIPS 2022 competition☆33Jul 26, 2023Updated 2 years ago
- ☆10Aug 24, 2022Updated 3 years ago
- AEC Tech Hackathon - Embodied Carbon Tool☆10Oct 20, 2020Updated 5 years ago
- a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and c…☆13Oct 3, 2019Updated 6 years ago
- Background materials for the article "Productivity Assessment of Neural Code Completion"☆13Jul 11, 2023Updated 2 years ago
- Very Simple and Basic Implementation of Compositional Pattern Producing Network in TensorFlow☆11Nov 27, 2019Updated 6 years ago
- ☆13May 7, 2023Updated 2 years ago
- Virtual notebook that Evan uses for his PhD thesis.☆10Sep 5, 2025Updated 6 months ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Helps create datasets scraped from Google Images☆12Oct 31, 2018Updated 7 years ago
- The AI that helps you achieve your goals☆11Feb 4, 2024Updated 2 years ago
- Summer Scheming!!!!!!☆11Aug 20, 2020Updated 5 years ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- MTG deck importer for Table Top Simulator☆10May 7, 2017Updated 8 years ago
- ☆14Mar 15, 2025Updated 11 months ago
- ☆15Mar 13, 2025Updated 11 months ago
- CNN Image Retrieval Model Weights Ported☆12Jun 2, 2018Updated 7 years ago
- an optimizing curry compiler☆14Nov 27, 2022Updated 3 years ago