mechanistic-interpretability-grokking / progress-measures-paperView external linksLinks
☆78Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for progress-measures-paper
Users that are interested in progress-measures-paper are comparing it to the libraries listed below
Sorting:
- Omnigrok: Grokking Beyond Algorithmic Data☆62Feb 24, 2023Updated 2 years ago
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆18Nov 24, 2023Updated 2 years ago
- ☆11Feb 11, 2020Updated 6 years ago
- Deep Networks Grok All the Time and Here is Why☆38May 18, 2024Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- ☆15Feb 28, 2025Updated 11 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆567Aug 7, 2025Updated 6 months ago
- "FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)☆15Jul 24, 2023Updated 2 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated last year
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- Xmixers: A collection of SOTA efficient token/channel mixers☆28Sep 4, 2025Updated 5 months ago
- Codebase for "On Targeted Manipulation and Deception when Optimizing LLMs for User Feedback". This repo implements a generative multi-tur…☆23Dec 3, 2024Updated last year
- A library for mechanistic interpretability of GPT-style language models☆3,073Updated this week
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆60Feb 7, 2025Updated last year
- ☆198Nov 17, 2024Updated last year
- [CHIL 2024] Interpretation of Intracardiac Electrograms Through Textual Representations☆12Sep 4, 2024Updated last year
- ENet for 2D semantic segmentation in ScanNet☆26Feb 11, 2019Updated 7 years ago
- Official code for UnICORNN (ICML 2021)☆28Oct 1, 2021Updated 4 years ago
- Sparse probing paper full code.☆66Dec 17, 2023Updated 2 years ago
- ☆207Oct 14, 2025Updated 4 months ago
- Attribution-based Parameter Decomposition☆33Jun 11, 2025Updated 8 months ago
- ☆35Apr 12, 2024Updated last year
- Python syntax generator based on Object-Oriented Programing, type hints, and simplicity☆10Sep 26, 2021Updated 4 years ago
- A visual interface for understanding and interpreting Transformers☆77Oct 21, 2023Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆135Sep 14, 2022Updated 3 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- ☆29Apr 4, 2024Updated last year
- Deeply supervised density regression for automatic cell counting in microscopy images☆12Jan 31, 2022Updated 4 years ago
- A Rat fMRI standardized protocol☆14Jul 25, 2024Updated last year
- Official repository for ACM Multimedia'23 paper "MATK: The Meme Analytical Tool Kit"☆13May 29, 2024Updated last year
- Sparsey, trademark Neurithmic Systems, is unsupervised learning algorithm inspired from the computations of cortical macro-columns and mi…☆12Feb 27, 2023Updated 2 years ago
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆85Mar 7, 2025Updated 11 months ago
- Implementation of various label fusion approaches for medical imaging.☆14May 29, 2025Updated 8 months ago
- ORB_SLAM2_error_analysis☆11Aug 4, 2017Updated 8 years ago
- ☆10Mar 5, 2025Updated 11 months ago
- Official code for "Blind Image Deblurring Based on Dual Attention Network and 2D Blur Kernel Estimation" (ICIP 2021)☆13Nov 11, 2025Updated 3 months ago
- 51单片机超轻量级实时操作系统,适合在8051为内核的MCU上运行☆11Jan 21, 2023Updated 3 years ago
- Methods 2: The General Linear Model☆15May 5, 2022Updated 3 years ago
- Bio-inspired neuromorphic cerebellum☆10Sep 29, 2023Updated 2 years ago