DataXujing / TensorRT-LLM-ChatGLM3

大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM
26Updated last year

Alternatives and similar repositories for TensorRT-LLM-ChatGLM3:

Users that are interested in TensorRT-LLM-ChatGLM3 are comparing it to the libraries listed below