kyegomez / MultiQueryAttention

This is a simple torch implementation of the high performance Multi-Query Attention
15Updated last year

Related projects

Alternatives and complementary repositories for MultiQueryAttention