Qcompiler / MixQ_Tensorrt_LLMView on GitHub
Mixed precision inference by Tensorrt-LLM
81Oct 23, 2024Updated last year

Alternatives and similar repositories for MixQ_Tensorrt_LLM

Users that are interested in MixQ_Tensorrt_LLM are comparing it to the libraries listed below

Sorting:

Are these results useful?