DataXujing / TensorRT-LLM-ChatGLM3

大模型部署实战:TensorRT-LLM, Triton Inference Server, vLLM
26Updated 8 months ago

Related projects

Alternatives and complementary repositories for TensorRT-LLM-ChatGLM3