google-deepmind / tracrLinks

☆540

Alternatives and similar repositories for tracr

Users that are interested in tracr are comparing it to the libraries listed below

Sorting:

tech-srl / RASP
An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"
☆317Updated 10 months ago
srush / raspy
An interactive exploration of Transformer programming.
☆266Updated last year
AlignmentResearch / tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
☆507Updated last year
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆519Updated last year
likenneth / othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆184Updated 2 years ago
google-deepmind / neural_networks_chomsky_hierarchy
Neural Networks and the Chomsky Hierarchy
☆207Updated last year
collin-burns / discovering_latent_knowledge
☆274Updated last year
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆215Updated 8 months ago
TransformerLensOrg / CircuitsVis
Mechanistic Interpretability Visualizations using React
☆265Updated 7 months ago
EleutherAI / elk
Keeping language models honest by directly eliciting knowledge encoded in their activations.
☆207Updated this week
google-research / meliad
☆256Updated last month
michaelhodel / re-arc
Reverse Engineering the Abstraction and Reasoning Corpus
☆289Updated 5 months ago
stanford-crfm / levanter
Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax
☆625Updated this week
jxbz / agd
Automatic gradient descent
☆208Updated 2 years ago
srush / Transformer-Puzzles
Puzzles for exploring transformers
☆355Updated 2 years ago
lucidrains / memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …
☆634Updated 2 years ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆231Updated 6 months ago
callummcdougall / ARENA_2.0
Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.
☆217Updated last year
tysam-code / hlb-gpt
Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…
☆349Updated 11 months ago
HazyResearch / safari
Convolutions for Sequence Modeling
☆895Updated last year
google-research / cascades
Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference…
☆207Updated last month
inverse-scaling / prize
A prize for finding tasks that cause large language models to show inverse scaling
☆613Updated last year
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
sanjeevanahilan / nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
☆290Updated last year
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆127Updated 2 years ago
samuela / git-re-basin
Code release for "Git Re-Basin: Merging Models modulo Permutation Symmetries"
☆486Updated 2 years ago
anthropics / PySvelte
A library for bridging Python and HTML/Javascript (via Svelte) for creating interactive visualizations
☆194Updated 3 years ago
JonasGeiping / cramming
Cramming the training of a (BERT-type) language model into limited compute.
☆1,339Updated last year
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆206Updated 7 months ago
neelnanda-io / 1L-Sparse-Autoencoder
☆123Updated last year