MIRALab-USTC / LLM-AttentionPredictorLinks

The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Chen Chen, Lei Chen, Xianzhi Yu, Wulong Liu, Jianye HAO, Mingxuan Yuan, Bin Li.
18Updated 2 weeks ago

Alternatives and similar repositories for LLM-AttentionPredictor

Users that are interested in LLM-AttentionPredictor are comparing it to the libraries listed below

Sorting: