A plug-and-play tool for visualizing attention-score heatmap in generative LLMs. Easy to customize for your own need.
☆51May 16, 2024Updated last year
Alternatives and similar repositories for Attention-Viewer
Users that are interested in Attention-Viewer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆19Jun 12, 2024Updated last year
- ☆29Apr 30, 2024Updated last year
- ☆14Jul 6, 2025Updated 9 months ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆13Jun 11, 2024Updated last year
- ☆22Apr 17, 2025Updated 11 months ago
- Resa: Transparent Reasoning Models via SAEs☆48Sep 23, 2025Updated 6 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- fork of karparthy's nanogpt with custom datasets☆10Jul 25, 2023Updated 2 years ago
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- [ 🎯 NeurIPS 2025 ] 3D-RAD 🩻: A Comprehensive 3D Radiology Med-VQA Dataset with Multi-Temporal Analysis and Diverse Diagnostic Tasks☆27Oct 28, 2025Updated 5 months ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 6 months ago
- 🌟Official code of our AAAI26 paper 🔍WebFilter☆38Nov 9, 2025Updated 5 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Load and visualize different datasets in video question answering☆10May 11, 2021Updated 4 years ago
- 基于 BPE 实现的中文分词。优化:预处理,并行计算,多字词,多词表☆14May 14, 2022Updated 3 years ago
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆17Dec 8, 2024Updated last year
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆12Sep 29, 2024Updated last year
- An isolated environment for DNS cache poisoning attack investigation and demonstration.☆10Nov 22, 2020Updated 5 years ago
- [SIGIR'25] Code of "Generative Recommender with End-to-End Learnable Item Tokenization".☆26Apr 17, 2025Updated 11 months ago
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- The complete codes of the paper "Multimodal Graph Contrastive Learning for Recommendation"☆10Mar 20, 2023Updated 3 years ago
- Conversational Recommender System Evaluation via Simulation☆19Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The implementation of paper "Strategy-aware Bundle Recommender System", SIGIR'23.☆15Sep 4, 2023Updated 2 years ago
- ☆56Oct 25, 2025Updated 5 months ago
- Code and data for Cell-o1.☆26Sep 19, 2025Updated 6 months ago
- ☆11Jul 28, 2023Updated 2 years ago
- ☆43Mar 24, 2023Updated 3 years ago
- 一个基于react16,react-router4,redux4的webapp。主要功能类似于朋友圈,动态编辑,图片上传,图片预览,点赞,评论,用户登录注册,用户日志管理,用户信息管理。服务等是采用express ,数据持久化采用的是mongodb。功能相对来说比较简单,主…☆10Apr 15, 2021Updated 4 years ago
- Invariant Feature Regularization for Fair Face Recognition (ICCV'23)☆15Oct 23, 2023Updated 2 years ago
- 使用django对情感分析功能进行封装,里面包含使用情感词典和Bert模型进行情感分类,最后可以使用tensorFlow serving将模型部署在docker中运行。☆12Sep 23, 2019Updated 6 years ago
- Code accompanying the paper "Massive Activations in Large Language Models"☆198Mar 4, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [KDD'25] Flow Matching for Collaborative Filtering☆22Sep 6, 2025Updated 7 months ago
- This repository is about how to implement your Intel realsense camera for skeleton tracking in Linux system.☆11Oct 22, 2020Updated 5 years ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- 中文 Python 笔记☆12Jan 15, 2018Updated 8 years ago
- GUIPilot: A Consistency-based Mobile GUI Testing Approach for Detecting Application-specific Bugs☆14Jan 5, 2026Updated 3 months ago
- ☆17Mar 18, 2026Updated 3 weeks ago
- Codebase for running (conditional) probing experiments☆21Nov 13, 2022Updated 3 years ago