MIRALab-USTC / LLM-AttentionPredictorLinks

The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Chen Chen, Lei Chen, Xianzhi Yu, Wulong Liu, Jianye HAO, Mingxuan Yuan, Bin Li.
18Updated last month

Alternatives and similar repositories for LLM-AttentionPredictor

Users that are interested in LLM-AttentionPredictor are comparing it to the libraries listed below

Sorting: