xxyux / SpInfer

SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
42Updated last month

Alternatives and similar repositories for SpInfer:

Users that are interested in SpInfer are comparing it to the libraries listed below