microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention
317Updated this week

Alternatives and similar repositories for vattention:

Users that are interested in vattention are comparing it to the libraries listed below