kyegomez / MultiQueryAttention

This is a simple torch implementation of the high performance Multi-Query Attention
15Updated last year

Related projects: