SqueezeAILab / SqueezedAttention

SQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference
29Updated 3 weeks ago

Alternatives and similar repositories for SqueezedAttention:

Users that are interested in SqueezedAttention are comparing it to the libraries listed below