hcd233 / Aris-AI-Model-ServerLinks

An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API

☆14

Alternatives and similar repositories for Aris-AI-Model-Server

Users that are interested in Aris-AI-Model-Server are comparing it to the libraries listed below

Sorting:

kevaldekivadiya2415 / textembed
TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…
☆24Updated 9 months ago
NVIDIA / workbench-llamafactory
This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.
☆60Updated 8 months ago
limcheekin / open-text-embeddings
Open Source Text Embedding Models with OpenAI Compatible API
☆154Updated 11 months ago
xorbitsai / xllamacpp
xllamacpp - a Python wrapper of llama.cpp
☆44Updated last week
mesolitica / transformers-openai-api
Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.
☆24Updated 3 months ago
ninehills / langeval
Evaluation for AI apps and agent
☆42Updated last year
etalab-ia / albert-models
Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆42Updated 11 months ago
habout632 / llms
llms related stuff , including code, docs
☆13Updated 4 months ago
sugarforever / AutoGen-Tutorials
Tutorials from AutoGen Basics to Use Cases
☆31Updated last year
AaronFeng753 / Better-Qwen3
Auto Thinking Mode switch for Qwen3 in Open webui
☆65Updated last month
siliconflow / siliconcloud-cookbook
SiliconCloud Cookbook
☆22Updated 3 months ago
asprenger / ray_vllm_inference
A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.
☆67Updated last year
chu-tianxiang / vllm-gptq
A high-throughput and memory-efficient inference and serving engine for LLMs
☆131Updated last year
amulil / vector_by_onnxmodel
accelerate generating vector by using onnx model
☆17Updated last year
LB-Young / Bambo
Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…
☆35Updated 4 months ago
yangyaofei / dify-vllm-provider
☆19Updated last month
zRzRzRzRzRzRzR / lm-fly
大模型推理框架加速，让 LLM 飞起来
☆18Updated last year
ziwang-com / mini-AGI
GPT+神器，简单实用的一站式AGI架构，内置本地化，LLM模型，agent，矢量数据库，智能链chain
☆48Updated last year
soulteary / dify-simple-rag-with-wp
02. Enabling various applications to be AI-enabled or used by AI.
☆28Updated 9 months ago
tc-mb / ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
☆27Updated last week
li-xiu-qi / x-pdf2md
本项目借助飞桨平台，构建起一套创新的多模型协同系统，实现 PDF 文件到 Markdown 文件的高效、精准转换。
☆16Updated 3 months ago
substratusai / stapi
Sentence Transformers API: An OpenAI compatible embedding API server
☆61Updated 9 months ago
Airmomo / tpo-llm-webui
TPO 是一个优化 LLM 输出文本的框架，通过迭代反馈和优化提示的方式来“微调模型”，而非直接调整模型的参数，使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型，实时优化基础模型并展示最佳结果。
☆10Updated 4 months ago
llm-factory / imitater
Imitate OpenAI with Local Models
☆86Updated 10 months ago
docling-project / docling-sdg
A set of tools to create synthetically-generated data from documents
☆18Updated last week
milvus-io / milvus-model
A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.
☆45Updated 2 months ago
backprop-ai / vllm-benchmark
Benchmarking the serving capabilities of vLLM
☆46Updated 10 months ago
neka-nat / mineru-api
MinerU API server
☆62Updated 6 months ago
ssbuild / aigc_evals
aigc evals
☆10Updated last year
cyber-tao / openai_api_playground
You can play any API server that compatible with OpenAI API
☆23Updated last year