A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.
☆52May 16, 2024Updated 2 years ago
Alternatives and similar repositories for Attention-Viewer
Users that are interested in Attention-Viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆67Sep 28, 2024Updated last year
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆19Jun 12, 2024Updated 2 years ago
- ☆29Apr 30, 2024Updated 2 years ago
- Efficient retrieval head analysis with triton flash attention that supports topK probability☆13Jun 15, 2024Updated 2 years ago
- ACL24☆11Jun 7, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- ☆14Jun 11, 2024Updated 2 years ago
- simple implementation of Expected Gradients and Integrated Gradients by pytorch☆12May 11, 2022Updated 4 years ago
- ☆22Apr 17, 2025Updated last year
- ☆10Mar 4, 2024Updated 2 years ago
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆32Jun 22, 2026Updated last week
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 9 months ago
- 使用fastrtc框架调用qwen-2.5-omni-realtime实现实时语音、视频等☆14Jun 27, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆13Mar 27, 2025Updated last year
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆39Nov 9, 2025Updated 7 months ago
- A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.☆10Nov 6, 2021Updated 4 years ago
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆17Dec 8, 2024Updated last year
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆11Sep 29, 2024Updated last year
- ☆15Apr 22, 2024Updated 2 years ago
- ☆15Mar 30, 2024Updated 2 years ago
- ☆17Apr 29, 2025Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Conversational Recommender System Evaluation via Simulation☆21Jun 23, 2026Updated last week
- Repo for preprint 2025 "MedHEval: Benchmarking Hallucinations and Mitigation Strategies in Medical Large Vision-Language Models"☆14Apr 23, 2025Updated last year
- ☆11Jul 28, 2023Updated 2 years ago
- QuantClaw is a plug-and-play task-type routing quantization plugin for OpenClaw.☆115Apr 27, 2026Updated 2 months ago
- 一个基于react16,react-router4,redux4的webapp。主要功能类似于朋友圈,动态编辑,图片上传,图片预览,点赞,评论,用户登录注册,用户日志管理,用户信息管理。服务等是采用express ,数据持久化采用的是mongodb。功能相对来说比较简单,主…☆10Apr 15, 2021Updated 5 years ago
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Oct 23, 2023Updated 2 years ago
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- ☆14Jul 17, 2024Updated last year
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆17Mar 11, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 使用django对情感分析功能进行封装,里面包含使用情感词典和Bert模型进行情感分类,最后可以使用tensorFlow serving将模型部署在docker中运行。☆12Sep 23, 2019Updated 6 years ago
- [NeurIPS 2023 (Spotlight)] Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts☆13Jan 30, 2024Updated 2 years ago
- ☆53May 13, 2024Updated 2 years ago
- [KDD'25] Flow Matching for Collaborative Filtering☆24Sep 6, 2025Updated 9 months ago
- Awesome papers for affective computing with llm and mllm☆30Nov 26, 2025Updated 7 months ago
- ☆17Mar 18, 2026Updated 3 months ago
- PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020☆17Oct 11, 2021Updated 4 years ago