hcd233 / Aris-AI-Model-ServerLinks
An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API
☆14Updated 2 months ago
Alternatives and similar repositories for Aris-AI-Model-Server
Users that are interested in Aris-AI-Model-Server are comparing it to the libraries listed below
Sorting:
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆24Updated 9 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆60Updated 8 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆154Updated 11 months ago
- xllamacpp - a Python wrapper of llama.cpp☆44Updated last week
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆24Updated 3 months ago
- Evaluation for AI apps and agent☆42Updated last year
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆42Updated 11 months ago
- llms related stuff , including code, docs☆13Updated 4 months ago
- Tutorials from AutoGen Basics to Use Cases☆31Updated last year
- Auto Thinking Mode switch for Qwen3 in Open webui☆65Updated last month
- SiliconCloud Cookbook☆22Updated 3 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆67Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆131Updated last year
- accelerate generating vector by using onnx model☆17Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated 4 months ago
- ☆19Updated last month
- 大模型推理框架加速,让 LLM 飞起来☆18Updated last year
- GPT+神器,简单实用的一站式AGI架构,内置本地化,LLM模型,agent,矢量数据库,智能链chain☆48Updated last year
- 02. Enabling various applications to be AI-enabled or used by AI.☆28Updated 9 months ago
- Get up and running with Llama 3, Mistral, Gemma, and other large language models.☆27Updated last week
- 本项目借助飞桨平台,构 建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆16Updated 3 months ago
- Sentence Transformers API: An OpenAI compatible embedding API server☆61Updated 9 months ago
- TPO 是一个优化 LLM 输出文本的框架,通过迭代反馈和优化提示的方式来“微调模型”,而非直接调整模型的参数,使模型在推理过程中与人类偏好对齐以生成更好的结果。本项目提供了一个友好的 WebUI 来加载模型,实时优化基础模型并展示最佳结果。☆10Updated 4 months ago
- Imitate OpenAI with Local Models☆86Updated 10 months ago
- A set of tools to create synthetically-generated data from documents☆18Updated last week
- A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.☆45Updated 2 months ago
- Benchmarking the serving capabilities of vLLM☆46Updated 10 months ago
- MinerU API server☆62Updated 6 months ago
- aigc evals☆10Updated last year
- You can play any API server that compatible with OpenAI API☆23Updated last year