mattneary / attentionLinks
visualizing attention for LLM users
☆216Updated 7 months ago
Alternatives and similar repositories for attention
Users that are interested in attention are comparing it to the libraries listed below
Sorting:
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆221Updated 8 months ago
- Evaluating LLMs with fewer examples☆160Updated last year
- Extract full next-token probabilities via language model APIs☆246Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆106Updated last month
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆505Updated last year
- ☆291Updated last year
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆181Updated 5 months ago
- ☆524Updated 7 months ago
- awesome synthetic (text) datasets☆289Updated last week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆206Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆138Updated 8 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆121Updated 7 months ago
- Erasing concepts from neural representations with provable guarantees☆230Updated 5 months ago
- ☆181Updated 2 months ago
- Controlled Text Generation via Language Model Arithmetic☆222Updated 10 months ago
- ☆239Updated 3 months ago
- [EMNLP 2023] Adapting Language Models to Compress Long Contexts☆307Updated 10 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆193Updated this week
- Multipack distributed sampler for fast padding-free training of LLMs☆195Updated 11 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆464Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆240Updated 5 months ago
- Function Vectors in Large Language Models (ICLR 2024)☆172Updated 3 months ago
- ☆273Updated last year
- Scaling Data-Constrained Language Models☆338Updated 2 weeks ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆245Updated 9 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆427Updated last year
- ☆310Updated last year
- Benchmarking LLMs with Challenging Tasks from Real Users☆229Updated 8 months ago
- A Collection of Competitive Text-Based Games for Language Model Evaluation and Reinforcement Learning☆210Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆242Updated 8 months ago