intel / neural-speedView on GitHub
An innovative library for efficient LLM inference via low-bit quantization
352Aug 30, 2024Updated last year

Alternatives and similar repositories for neural-speed

Users that are interested in neural-speed are comparing it to the libraries listed below

Sorting:

Are these results useful?