DeepAuto-AI / hip-attention

Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
19Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for hip-attention