SqueezeAILab / SqueezedAttentionView on GitHub
[ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference
58Nov 20, 2024Updated last year

Alternatives and similar repositories for SqueezedAttention

Users that are interested in SqueezedAttention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?