mattneary / attentionLinks

visualizing attention for LLM users

☆216

Alternatives and similar repositories for attention

Users that are interested in attention are comparing it to the libraries listed below

Sorting:

allenai / wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
☆221Updated 8 months ago
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆160Updated last year
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆246Updated last year
msclar / formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
☆106Updated last month
kmeng01 / memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
☆505Updated last year
lukasberglund / reversal_curse
☆291Updated last year
shengliu66 / ICV
Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
☆181Updated 5 months ago
huggingface / cosmopedia
☆524Updated 7 months ago
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆289Updated last week
callummcdougall / sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
☆206Updated 7 months ago
dwzhu-pku / LongEmbed
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆138Updated 8 months ago
QingruZhang / PASTA
PASTA: Post-hoc Attention Steering for LLMs
☆121Updated 7 months ago
EleutherAI / concept-erasure
Erasing concepts from neural representations with provable guarantees
☆230Updated 5 months ago
da03 / Internalize_CoT_Step_by_Step
☆181Updated 2 months ago
eth-sri / language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
☆222Updated 10 months ago
Data-Provenance-Initiative / Data-Provenance-Collection
☆239Updated 3 months ago
princeton-nlp / AutoCompressors
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆307Updated 10 months ago
EleutherAI / delphi
Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …
☆193Updated this week
imoneoi / multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
☆195Updated 11 months ago
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆464Updated last year
Mihaiii / llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆240Updated 5 months ago
ericwtodd / function_vectors
Function Vectors in Large Language Models (ICLR 2024)
☆172Updated 3 months ago
collin-burns / discovering_latent_knowledge
☆273Updated last year
huggingface / datablations
Scaling Data-Constrained Language Models
☆338Updated 2 weeks ago
MadryLab / context-cite
Attribute (or cite) statements generated by LLMs back to in-context information.
☆245Updated 9 months ago
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆427Updated last year
Re-Align / URIAL
☆310Updated last year
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆229Updated 8 months ago
LeonGuertler / TextArena
A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning
☆210Updated this week
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆242Updated 8 months ago