mattneary / attentionLinks
visualizing attention for LLM users
☆232Updated 11 months ago
Alternatives and similar repositories for attention
Users that are interested in attention are comparing it to the libraries listed below
Sorting:
- Evaluating LLMs with fewer examples☆167Updated last year
- Extract full next-token probabilities via language model APIs☆247Updated last year
- Controlled Text Generation via Language Model Arithmetic☆223Updated last year
- ☆297Updated last year
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆223Updated 11 months ago
- Mass-editing thousands of facts into a transformer memory (ICLR 2023)☆525Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆110Updated 5 months ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆192Updated 9 months ago
- awesome synthetic (text) datasets☆305Updated 4 months ago
- Multipack distributed sampler for fast padding-free training of LLMs☆201Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆189Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆247Updated 8 months ago
- ☆552Updated 11 months ago
- Scaling Data-Constrained Language Models☆342Updated 4 months ago
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago
- DSIR large-scale data selection framework for language model training☆265Updated last year
- ☆149Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆127Updated 11 months ago
- ☆256Updated 7 months ago
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆215Updated 2 months ago
- Functional Benchmarks and the Reasoning Gap☆89Updated last year
- Code for the paper "Fishing for Magikarp"☆172Updated 6 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆276Updated last year
- Inference-Time Intervention: Eliciting Truthful Answers from a Language Model☆554Updated 9 months ago
- ☆156Updated last year
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆217Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆144Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆297Updated last year
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆477Updated last year
- ☆197Updated 6 months ago