ChenMnZ / PrefixQuant

An algorithm for static activation quantization of LLMs
67Updated this week

Related projects

Alternatives and complementary repositories for PrefixQuant