ByteDance-Seed / FlexPrefillLinks

Code for paper: [ICLR2025 Oral] FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference
107Updated 2 weeks ago

Alternatives and similar repositories for FlexPrefill

Users that are interested in FlexPrefill are comparing it to the libraries listed below

Sorting: