mechanistic-interpretability-grokking/progress-measures-paper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mechanistic-interpretability-grokking/progress-measures-paper)

mechanistic-interpretability-grokking / progress-measures-paper

☆92

Alternatives and similar repositories for progress-measures-paper

Users that are interested in progress-measures-paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

psoulos / role-decomposition
View on GitHub
☆11Feb 11, 2020Updated 6 years ago
apple / ml-tic-lm
View on GitHub
Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025
☆24Apr 19, 2026Updated 3 months ago
AlignmentResearch / tuned-lens
View on GitHub
Tools for understanding how transformer predictions are built layer-by-layer
☆605Aug 7, 2025Updated 11 months ago
fjzzq2002 / pizza
View on GitHub
Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"
☆20Nov 24, 2023Updated 2 years ago
INK-USC / FiD-ICL
View on GitHub
"FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning" (ACL 2023)
☆15Jul 24, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
AhmedImtiazPrio / grok-adversarial
View on GitHub
Deep Networks Grok All the Time and Here is Why
☆40Apr 20, 2026Updated 3 months ago
LRudL / evalugator
View on GitHub
(Model-written) LLM evals library
☆19Dec 13, 2024Updated last year
amitlevy / evolutionaryGPT
View on GitHub
Evolutionary Search for expert-level performance on any task with environmental feedback
☆14Oct 12, 2025Updated 9 months ago
noemaresearch / pinboard
View on GitHub
Pin files for contextual, codebase-level AI assistance.
☆16Jul 11, 2024Updated 2 years ago
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
eboix / high_precision_barycenters
View on GitHub
Compute free-support Wasserstein barycenters exactly
☆10Aug 22, 2024Updated last year
d-doshi / Grokking
View on GitHub
☆19Feb 28, 2025Updated last year
nkandpa2 / long_tail_knowledge
View on GitHub
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Apr 12, 2023Updated 3 years ago
chmathys-teaching-f22 / methods-2-course
View on GitHub
Methods 2: The General Linear Model
☆15May 5, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tk-rusch / unicornn
View on GitHub
Official code for UnICORNN (ICML 2021)
☆28Oct 1, 2021Updated 4 years ago
jacobdunefsky / transcoder_circuits
View on GitHub
☆212Nov 17, 2024Updated last year
rrmenon10 / ExEnt
View on GitHub
[ACL 2022] CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations
☆10Jun 5, 2022Updated 4 years ago
mathiasisaksen / GRFics
View on GitHub
An R library for efficient generation of noise functions from approximate Gaussian random fields
☆12May 27, 2021Updated 5 years ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆28Sep 4, 2025Updated 10 months ago
jscad / scad-api
View on GitHub
DEPRECATED: OpenSCAD like API for JSCAD (See the link below)
☆14Jan 17, 2018Updated 8 years ago
phueb / Zorro
View on GitHub
Grammar test suite for masked language models
☆10Jan 1, 2023Updated 3 years ago
MikaStars39 / StableMask
View on GitHub
PyTorch implementation of StableMask (ICML'24)
☆15Jun 27, 2024Updated 2 years ago
john-hewitt / dyckkm-learning
View on GitHub
Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language
☆16Oct 11, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HoagyC / sparse_coding
View on GitHub
Using sparse coding to find distributed representations used by neural networks.
☆306Nov 10, 2023Updated 2 years ago
INK-USC / PE2
View on GitHub
Code for paper "Prompt Engineering a Prompt Engineer" (https://arxiv.org/abs/2311.05661)
☆12Aug 1, 2024Updated last year
yixinwang / representation-causal-public
View on GitHub
☆46Feb 16, 2022Updated 4 years ago
alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
saprmarks / feature-circuits
View on GitHub
☆223Oct 14, 2025Updated 9 months ago
xingyizhao / PURE
View on GitHub
Code associated with ICML (2024). "Defense against Backdoor Attack on Pre-trained Language Models via Head Pruning and Attention Normaliz…
☆11Feb 22, 2026Updated 5 months ago
RobertCsordas / transformer_generalization
View on GitHub
The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…
☆66Dec 16, 2022Updated 3 years ago
fbarez / neuroplasticity
View on GitHub
☆14Mar 31, 2024Updated 2 years ago
ApolloResearch / apd
View on GitHub
Attribution-based Parameter Decomposition
☆35Jun 11, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
8enmann / webtendo
View on GitHub
Multiplayer JS game platform
☆16Oct 16, 2017Updated 8 years ago
ZiyueWang25 / llm-security-challenge
View on GitHub
Can Large Language Models Solve Security Challenges? We test LLMs' ability to interact and break out of shell environments using the Over…
☆13Aug 21, 2023Updated 2 years ago
zhuxinqimac / stylegan2vp
View on GitHub
☆11Jul 28, 2021Updated 4 years ago
explanare / ravel
View on GitHub
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
☆58Oct 30, 2025Updated 8 months ago
AliMorty / Text-Classification
View on GitHub
In this project, we used 3 different metrics (Information Gain, Mutual Information, Chi Squared) to find important words and then we used…
☆11Aug 7, 2018Updated 7 years ago
necludov / action-matching
View on GitHub
Implementation of Action Matching for the Schrödinger equation
☆25Jun 18, 2023Updated 3 years ago
zeyuyun1 / TransformerVis
View on GitHub
☆43Nov 16, 2021Updated 4 years ago