xorbitsai / inferenceLinks
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
☆8,793Updated this week
Alternatives and similar repositories for inference
Users that are interested in inference are comparing it to the libraries listed below
Sorting:
- GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型☆6,937Updated 4 months ago
- Retrieval and Retrieval-augmented LLMs☆10,931Updated last month
- Question and Answer based on Anything.