VectorInstitute / vector-inference

Efficient LLM inference on Slurm clusters using vLLM.
39Updated last week

Related projects

Alternatives and complementary repositories for vector-inference