tloen / llama-int8Links

Quantized inference code for LLaMA models
1,048Updated 2 years ago

Alternatives and similar repositories for llama-int8

Users that are interested in llama-int8 are comparing it to the libraries listed below

Sorting: