VectorInstitute / vector-inference

Efficient LLM inference on Slurm clusters using vLLM.
38Updated this week

Related projects

Alternatives and complementary repositories for vector-inference