bjj / exllamav2-openai-server

An OpenAI API compatible LLM inference server based on ExLlamaV2.
22Updated 9 months ago

Related projects

Alternatives and complementary repositories for exllamav2-openai-server