SqueezeAILab / SqueezedAttention

SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference
36Updated 2 months ago

Alternatives and similar repositories for SqueezedAttention:

Users that are interested in SqueezedAttention are comparing it to the libraries listed below