wangqinsi1 / CoreInfer

This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation.
β˜†15Updated 2 months ago

Alternatives and similar repositories for CoreInfer:

Users that are interested in CoreInfer are comparing it to the libraries listed below