wln20/Attention-Viewer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wln20/Attention-Viewer)

wln20 / Attention-Viewer

A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.

☆52

Alternatives and similar repositories for Attention-Viewer

Users that are interested in Attention-Viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wln20 / CSKV
View on GitHub
[NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios
☆16Oct 18, 2024Updated last year
raymin0223 / fast_robust_early_exit
View on GitHub
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
☆67Sep 28, 2024Updated last year
XMUDeepLIT / QGC
View on GitHub
Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)
☆20Jun 12, 2024Updated 2 years ago
dannyallover / overthinking_the_truth
View on GitHub
☆29Apr 30, 2024Updated 2 years ago
FSoft-AI4Code / VisualCoder
View on GitHub
[NAACL 2025] Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning
☆10Feb 9, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
FranxYao / Retrieval-Head-with-Flash-Attention
View on GitHub
Efficient retrieval head analysis with triton flash attention that supports topK probability
☆13Jun 15, 2024Updated 2 years ago
eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago
HoraN1 / docker-ros-gui
View on GitHub
An image of ROS with Nvidia cudagl to enable GUI
☆14Apr 18, 2020Updated 6 years ago
mattf1n / basis-aware-threshold
View on GitHub
Code for the paper "Closing the Curious Case of Neural Text Degeneration"
☆12Apr 9, 2025Updated last year
Tiankai-Jiang / CFG-Generator
View on GitHub
Python Control Flow Graph Generator
☆19Feb 28, 2022Updated 4 years ago
notsebastiano / GP_algorithm
View on GitHub
Implementation of the Grassberger-Procaccia algorithm to estimate the Correlation Dimension of a set of points
☆16Jan 18, 2022Updated 4 years ago
techtee-ltd / TensorFlow_Threejs_FaceMesh
View on GitHub
This Repository has a project which explains how to generate a face point cloud in Three js using Tensorflow Js
☆23Jun 9, 2022Updated 4 years ago
hobinkwak / ExpectedGradients_IntegratedGradients_pytorch
View on GitHub
simple implementation of Expected Gradients and Integrated Gradients by pytorch
☆12May 11, 2022Updated 4 years ago
luka-group / CoIN
View on GitHub
☆14Jun 11, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
linxihui / dkernel
View on GitHub
☆22Apr 17, 2025Updated last year
aseec-lab / llms-for-code-analysis
View on GitHub
☆34Aug 28, 2024Updated last year
pariajm / english-fisher-annotations
View on GitHub
A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset
☆13May 2, 2021Updated 5 years ago
gdgcloudsantiago / gen-ai
View on GitHub
Recursos sobre Inteligencia Artificial Generativa
☆12Jul 18, 2023Updated 3 years ago
ttwthomas / nanogpt
View on GitHub
fork of karparthy's nanogpt with custom datasets
☆11Jul 25, 2023Updated 2 years ago
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
fengnian123 / qwen-2.5-omni-realtime-chat
View on GitHub
使用fastrtc框架调用qwen-2.5-omni-realtime实现实时语音、视频等
☆14Jun 27, 2025Updated last year
hfutmars / MGCL
View on GitHub
The complete codes of the paper "Multimodal Graph Contrastive Learning for Recommendation"
☆10Mar 20, 2023Updated 3 years ago
Horizon2333 / videoqa_dataset_visualization
View on GitHub
Load and visualize different datasets in video question answering
☆10May 11, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Stacy027 / COIECD
View on GitHub
code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"
☆12Sep 29, 2024Updated last year
PKUFlyingPig / Awesome-System-for-Machine-Learning
View on GitHub
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
☆10Nov 6, 2021Updated 4 years ago
zphw / dns-cache-poisoning-demo
View on GitHub
An isolated environment for DNS cache poisoning attack investigation and demonstration.
☆10Nov 22, 2020Updated 5 years ago
LightR0 / hugging_face_tutorials
View on GitHub
☆18Apr 28, 2022Updated 4 years ago
nju-websoft / KnowLA
View on GitHub
KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024
☆16Jul 29, 2024Updated last year
Aofei-Chang / MedHEval
View on GitHub
Repo for preprint 2025 "MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models"
☆16Apr 23, 2025Updated last year
GXimingLu / a_star_neurologic
View on GitHub
☆43Mar 24, 2023Updated 3 years ago
titu1994 / warprnnt_numba
View on GitHub
WarpRNNT loss ported in Numba CPU/CUDA for Pytorch
☆17Mar 11, 2022Updated 4 years ago
PanasonicConnect / InvReg
View on GitHub
Invariant Feature Regularization for Fair Face Recognition (ICCV'23)
☆15Oct 23, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
locuslab / massive-activations
View on GitHub
Code accompanying the paper "Massive Activations in Large Language Models"
☆201Mar 4, 2024Updated 2 years ago
Linxi000 / UniRec
View on GitHub
☆14Jul 17, 2024Updated 2 years ago
hdong920 / LESS
View on GitHub
☆53May 13, 2024Updated 2 years ago
zhu-minjun / SafetyLock
View on GitHub
Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!
☆11Oct 16, 2024Updated last year
UCSB-NLP-Chang / Fairness-Reprogramming
View on GitHub
☆16Oct 16, 2023Updated 2 years ago
pritamqu / OOD-VSSL
View on GitHub
[NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts
☆13Jan 30, 2024Updated 2 years ago
zrz1996 / Spam-Email-Classifier-DataSet
View on GitHub
Some simple codes to format the CSDMC2010 SPAM corpus
☆11Sep 18, 2016Updated 9 years ago