adamimos / epsilon-transformersLinks

epsilon machines and transformers!

☆33

Alternatives and similar repositories for epsilon-transformers

Users that are interested in epsilon-transformers are comparing it to the libraries listed below

Sorting:

timaeus-research / devinterp
Tools for studying developmental interpretability in neural networks.
☆114Updated 4 months ago
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆130Updated 3 years ago
genlm / llamppl
Probabilistic programming with large language models
☆144Updated this week
ApolloResearch / apd
Attribution-based Parameter Decomposition
☆31Updated 5 months ago
neoneye / arc-notes
My writings about ARC (Abstraction and Reasoning Corpus)
☆85Updated 2 weeks ago
neoneye / ARC-Interactive-History-Dataset
The history files when recording human interaction while solving ARC tasks
☆118Updated last week
bilal-chughtai / rep-theory-mech-interp
☆27Updated 2 years ago
apartresearch / interpretability-starter
🧠 Starter templates for doing interpretability research
☆75Updated 2 years ago
goodfire-ai / spd
Stochastic Parameter Decomposition
☆51Updated last week
JasonGross / guarantees-based-mechanistic-interpretability
☆17Updated last week
thestephencasper / everything-you-need
we got you bro
☆36Updated last year
google-deepmind / mishax
☆143Updated 2 months ago
jbloomAus / SAEDashboard
☆79Updated last month
goodfire-ai / scribe
☆53Updated last month
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆34Updated last year
mechanistic-interpretability-grokking / progress-measures-paper
☆70Updated 3 years ago
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆227Updated 11 months ago
TransformerLensOrg / CircuitsVis
Mechanistic Interpretability Visualizations using React
☆301Updated 11 months ago
google-deepmind / neural_networks_solomonoff_induction
Learning Universal Predictors
☆81Updated last year
neelnanda-io / Grokking
A Mechanistic Interpretability Analysis of Grokking
☆23Updated 3 years ago
KindXiaoming / BIMT
Brain-Inspired Modular Training (BIMT), a method for making neural networks more modular and interpretable.
☆173Updated 2 years ago
KihoPark / LLM_Categorical_Hierarchical_Representations
☆111Updated 9 months ago
ApolloResearch / sample
Repository with sample code using Apollo's suggested engineering practices
☆13Updated 11 months ago
victorvikram / ConceptARC
Materials for ConceptARC paper
☆106Updated last year
goodfire-ai / r1-interpretability
Open source interpretability artefacts for R1.
☆163Updated 7 months ago
akarshkumar0101 / fer
Code for the Fractured Entangled Representation Hypothesis position paper!
☆206Updated 2 weeks ago
clement-bonnet / lpn
Latent Program Network (from the "Searching Latent Program Spaces" paper)
☆105Updated last month
METR / vivaria
Vivaria is METR's tool for running evaluations and conducting agent elicitation research.
☆120Updated last week
probcomp / LLaMPPL
A domain-specific probabilistic programming language for modeling and inference with language models
☆137Updated 6 months ago
callummcdougall / sae_visualizer
☆29Updated last year