DataXujing / TensorRT-LLM-ChatGLM3

大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM
26Updated 11 months ago

Alternatives and similar repositories for TensorRT-LLM-ChatGLM3:

Users that are interested in TensorRT-LLM-ChatGLM3 are comparing it to the libraries listed below