xxyux / SpInfer

SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
32Updated last week

Alternatives and similar repositories for SpInfer:

Users that are interested in SpInfer are comparing it to the libraries listed below