intel / neural-speedLinks

An innovative library for efficient LLM inference via low-bit quantization
348Updated 9 months ago

Alternatives and similar repositories for neural-speed

Users that are interested in neural-speed are comparing it to the libraries listed below

Sorting: