intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization
351Updated 5 months ago

Alternatives and similar repositories for neural-speed:

Users that are interested in neural-speed are comparing it to the libraries listed below