mattneary / attentionLinks
visualizing attention for LLM users
☆237Updated last year
Alternatives and similar repositories for attention
Users that are interested in attention are comparing it to the libraries listed below
Sorting:
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆226Updated last year
- Extract full next-token probabilities via language model APIs☆248Updated last year
- ☆300Updated 2 years ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆535Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆113Updated 7 months ago
- Evaluating LLMs with fewer examples☆169Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆196Updated 11 months ago
- Controlled Text Generation via Language Model Arithmetic☆224Updated last year
- ☆162Updated last year
- ☆557Updated last year
- A toolkit for describing model features and intervening on those features to steer behavior.☆225Updated last month
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆222Updated last month
- awesome synthetic (text) datasets☆321Updated last week
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆128Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆278Updated last year
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆237Updated last year
- Steering vectors for transformer language models in Pytorch / Huggingface☆137Updated 10 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆189Updated 2 years ago
- code for training & evaluating Contextual Document Embedding models☆202Updated 8 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆373Updated 2 years ago
- ☆283Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆145Updated last year
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆236Updated this week
- Tools for understanding how transformer predictions are built layer-by-layer☆560Updated 5 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆324Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆313Updated last year
- ☆261Updated 9 months ago
- The official evaluation suite and dynamic data release for MixEval.☆253Updated last year