mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
6,749Updated 6 months ago

Alternatives and similar repositories for streaming-llm:

Users that are interested in streaming-llm are comparing it to the libraries listed below