mit-han-lab / streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
6,688Updated 4 months ago

Related projects

Alternatives and complementary repositories for streaming-llm