catherinesyeh / attention-vizLinks

Visualizing query-key interactions in language + vision transformers

☆149

Alternatives and similar repositories for attention-viz

Users that are interested in attention-viz are comparing it to the libraries listed below

Sorting:

wesg52 / world-models
Extracting spatial and temporal world models from LLMs
☆256Updated last year
HazyResearch / TART
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆200Updated 2 years ago
KihoPark / LLM_Categorical_Hierarchical_Representations
☆101Updated 5 months ago
likenneth / othello_world
Emergent world representations: Exploring a sequence model trained on a synthetic task
☆182Updated 2 years ago
allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆267Updated 2 months ago
KaiNylund / lm-weights-encode-time
☆68Updated 10 months ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆230Updated 5 months ago
google-deepmind / mishax
☆134Updated 3 months ago
anthropics / toy-models-of-superposition
Notebooks accompanying Anthropic's "Toy Models of Superposition" paper
☆127Updated 2 years ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆190Updated last year
wolfecameron / lora_instruction_tune
☆39Updated last year
allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆90Updated last year
srush / do-we-need-attention
☆166Updated 2 years ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆88Updated 9 months ago
SALT-NLP / demonstrated-feedback
☆124Updated 9 months ago
joshuacnf / Ctrl-G
☆86Updated 6 months ago
neulab / gemini-benchmark
☆150Updated last year
microsoft / mechanistic-error-probe
A mechanistic approach for understanding and detecting factual errors of large language models.
☆46Updated last year
lucidrains / mirasol-pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
☆89Updated last year
llm-merging / LLM-Merging
LLM-Merging: Building LLMs Efficiently through Merging
☆201Updated 9 months ago
sdascoli / boolformer
☆163Updated last year
lucidrains / CALM-pytorch
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
☆177Updated 10 months ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
snap-stanford / MLAgentBench
☆297Updated last year
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆77Updated 7 months ago
allenai / discoverybench
Discovering Data-driven Hypotheses in the Wild
☆99Updated last month
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆184Updated this week
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆206Updated 6 months ago
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆227Updated 4 months ago
EleutherAI / delphi
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …
☆192Updated this week