jorahn / llama-int8

Quantized inference code for LLaMA models
13Updated last year

Related projects

Alternatives and complementary repositories for llama-int8