usyd-fsalab / fp6_llmLinks

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).
251Updated 7 months ago

Alternatives and similar repositories for fp6_llm

Users that are interested in fp6_llm are comparing it to the libraries listed below

Sorting: