modelize-ai / LLM-Inference-Deployment-Tutorial

Tutorial for LLM developers about engine design, service deployment, evaluation/benchmark, etc. Provide a C/S style optimized LLM inference engine.
19Updated last year

Related projects

Alternatives and complementary repositories for LLM-Inference-Deployment-Tutorial