NVIDIA / trt-llm-as-openai-windows

This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows instead of cloud.
116Updated 8 months ago

Related projects

Alternatives and complementary repositories for trt-llm-as-openai-windows