xxyux / SpInfer

SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs
44Updated last month

Alternatives and similar repositories for SpInfer

Users that are interested in SpInfer are comparing it to the libraries listed below

Sorting: