mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
6,848Updated 9 months ago

Alternatives and similar repositories for streaming-llm:

Users that are interested in streaming-llm are comparing it to the libraries listed below