MIRALab-USTC / LLM-AttentionPredictorLinks

The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Chen Chen, Lei Chen, Xianzhi Yu, Wulong Liu, Jianye HAO, Mingxuan Yuan, Bin Li.
17Updated this week

Alternatives and similar repositories for LLM-AttentionPredictor

Users that are interested in LLM-AttentionPredictor are comparing it to the libraries listed below

Sorting: