catherinesyeh / attention-vizLinks
Visualizing query-key interactions in language + vision transformers (VIS 2023)
☆157Updated last year
Alternatives and similar repositories for attention-viz
Users that are interested in attention-viz are comparing it to the libraries listed below
Sorting:
- Emergent world representations: Exploring a sequence model trained on a synthetic task☆197Updated 2 years ago
- Extracting spatial and temporal world models from LLMs☆257Updated 2 years ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆269Updated 8 months ago
- ☆69Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆559Updated 5 months ago
- ☆112Updated 10 months ago
- ☆150Updated 2 years ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆132Updated 3 years ago
- ☆150Updated 4 months ago
- ☆105Updated last year
- LLM-Merging: Building LLMs Efficiently through Merging☆208Updated last year
- Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind☆179Updated last year
- PAIR.withgoogle.com and friend's work on interpretability methods☆217Updated last month
- ☆167Updated 2 years ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆198Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆72Updated last year
- ☆95Updated last year
- [NeurIPS 2023] Learning Transformer Programs☆162Updated last year
- Scaling Data-Constrained Language Models☆343Updated 6 months ago
- Evaluating LLMs with fewer examples☆170Updated last year
- ☆162Updated last year
- ☆38Updated last year
- ☆323Updated last year
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆52Updated last year
- Recurrent Memory Transformer☆154Updated 2 years ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆202Updated 2 years ago
- ☆301Updated 2 years ago
- Erasing concepts from neural representations with provable guarantees☆242Updated 11 months ago
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- ☆261Updated 9 months ago